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Preface 


Problems in the real world arise in many contexts. It could be: 

(i) to gain new understanding about some phenomena, for example how 
does a forest fire spread; 

(ii) to obtain the response behaviour of a system, such as the flight of a rocket; 

(iii) to design a complicated piece of equipment like a thermal power station; 
or 

(iv) to optimize some performance measure such as profits of a company. 

Although problem solving has been practised for a very long time, the use of 
mathematics as a very effective tool in problem solving has gained prominence 
in the last 50 years, mainly due to the rapid developments in computing. 

The critical step in the use of mathematics for solving real-world problems is 
the building of a suitable mathematical model. The importance of this is 
recognized in the fact that the past 20 years has seen the appearance of a large 
number of books on mathematical models and modelling, and several journals 
devoted to the topic. 

In the literature, models are often presented in their final, neat and elegant 
form. In reality there are many steps, choices and iterative processes that a 
modeller goes through in reaching a satisfactory model. These features are 
usually hidden, and hence the beginner fails to see them. Each step in the 
modelling process requires an understanding of a variety of concepts and 
techniques blended with a combination of critical and creative thinking, 
intuition and foresight, and decision making. This makes model building both 
a science and an art. 

This book advocates a novel approach to the teaching of the building 
process for mathematical models. The essential features of the approach are its 
scope, an emphasis on the art as well as the science aspects, and the 
development of a self-learning process by the student. 

The art aspect of modelling is difficult to teach in any formal sense. We have 
adopted a “learning through doing” approach somewhat akin to studio 
activities in, say, architecture. In this self-learning activity the instructor plays 

xiii 
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a critical role in guiding the student to appreciate the art aspect in an open and 
constructively critical environment. In addition to developing the various 
thinking and decision making skills, this format develops confidence in the 
student to solve new and unfamiliar problems. 

A special difficulty in the teaching of the science aspect is the breadth and 
depth of material needed for model building. The approach we have taken is as 
follows. We develop an overview of the modelling process to identify the 
different topics needed for model building, highlight the basic concepts and 
relevance of each topic and finally guide the student to learn the details 
through a self-learning approach by providing a rich bibliography. This not 
only achieves a sensible resolution of the conflict between breadth and depth 
but, more importantly, avoids the familiar trap where the student fails to 
differentiate the trees from the woods. 

In summary, the aims of the book are to develop in the student: 

(i) an appreciation of the total view of the modelling process from its logical 
starting point to the final point which yields the desired solution to the 
problem of interest; 

(ii) an understanding of the framework to carry out the modelling process; 

(iii) an understanding of the role and significance of the different topics 
relevant to the modelling process; 

(iv) an appreciation of the art and the science aspects of mathematical 
modelling; 

(v) the need for active self-learning. 

In addition, it aims to help the instructor discharge his role as a guide. 

The book is structured into the following three parts: 

Part I Methodology and Tools 

Part II Case Studies 

Part III Supplementary Material 

Part I consists of 13 chapters, and deals with the concepts and techniques 
needed for modelling. An outline of the different chapters is given in Chapter 1. 
Part II consists of three detailed case studies which bring all the material 
covered in Part I together in the context of model building. Part III consists of 
two chapters. The first deals with a large number of real-world problems suited 
for mathematical modelling which are meant for use as modelling exercises. 
The second is primarily meant for the instructor as a source for supplementary 
material. It contains a list of books on mathematical models and modelling, 
and a comprehensive list of articles on models, modelling and the teaching of 
modelling. 

Different levels of mathematical background are needed to appreciate and 
understand the different topics covered in Part I. It is doubtful if any 
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undergraduate programme would provide the ideal mathematical back¬ 
ground required. As a result, many topics will be new to the student. A word of 
caution is appropriate at this stage. Our approach to teaching mathematical 
modelling requires a broad overview of the modelling process which would be 
obscured to some extent if all details were provided. Consequently, the 
instructor will need to give the student an intuitive feel for any topic with which 
the student is unfamiliar, and should encourage the student to embark on a 
self-learning venture. Such a venture will not only cover any deficiency in the 
student's background but, more importantly, it will develop the self-learning 
and other skills needed for mathematical modelling. Notes at the end of each 
chapter give suitable references as starting points for such ventures. 

The book can be used as a text for teaching mathematical modelling at two 
different levels in an undergraduate programme. At the first level (suited for the 
2nd or 3rd year of an undergraduate programme in science or engineering) it 
can be used for an introductory course on deterministic modelling based on 
the following chapter sequence: 


Chapters 1-2-3 4 5-6 10 (part)-12 (part) 13. 

At the second level (suited for the 3rd or 4th year of an undergraduate 
programme in science or engineering) it can be used for an introductory course 
on stochastic modelling based on the following chapter sequence: 


Chapters 1 -2-3-4-7-8-9-10-11-12-13. 


At the postgraduate level this book can be used with Part III as the starting 
point and the student producing an end-product similar to that in Part II. This 
would require material from all earlier chapters, supplemented by additional 
material from the references at the end of each chapter. 

It is impossible to acknowledge individually the authors of scores of 
publications on mathematical modelling, and the teaching of mathematical 
modelling, that have influenced the authors over their long period of interest in 
this topic. We have attempted to make our reference list as comprehensive as 
possible. Any omission is unintentional and the authors apologize for it. 

Chapter 14 is heavily influenced by the work of Dr. Bailey. Table 14.1 and 
various parameter estimates are from his book The Biomathematics of 
Malaria , and we thank the publishers, Charles Griffin Company, for their 
permission to use them. Also we thank the World Health Organization for 
their permission to reproduce Figures 14.1 and 14.3, and related equations 
from their Bulletin. Chapter 15 is based on a paper by Drs. Martens and Bell. 
We would like to thank them, and the American Society of Mechanical 
Engineers, for their permission to make extensive use of the material in our 
chapter. Section 2 of Chapter 18 is based on a paper which appeared in 
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Mathematical Modelling , and the authors thank Pergamon Press for their 
permission to use the material. 

Many people read one or more chapters of earlier versions and gave helpful 
comments. In particular we would like to thank Drs. Peter Jacobs, Duncan 
Gilmore, Phil Diamond, Lou Westphal and Bill Daniel. The authors take full 
responsibility for the final version and any shortcomings. Istiana Djamaludin 
checked the reference lists with great care, and the authors are indebted to her 
for her services. 

Special thanks are due to the typists who were involved in the various 
versions. In particular, our thanks to Joan Krendlinger, Judy Wheelan and 
Kathryn Lancaster for their efforts. 

Finally, this book is dedicated to teachers all over the world who are 
involved in teaching mathematical modelling. 


PART I 

METHODOLOGY AND TOOLS 




CHAPTER 1 


Role of Mathematics in Problem 
Solving 


1 Introduction 

The use of mathematics in solving real-world problems (as distinct from 
artificial textbook problems) has become widespread in recent times. This is 
partly due to the use of the systems approach to problem solving, and to the 
increasing computational power of digital computers and computing metho¬ 
dology, both of which have made many more problems amenable to 
mathematical solution. There is hardly a branch of learning where mathema¬ 
tics and computing have not made an impact. 

The steps involved in using mathematics to solve real-world problems are 
shown in Figure 1.1. The most crucial and important step is the satisfactory 
translation of the problem from the real physical world into a mathematical 
description. Once this is done, standard techniques of mathematical analysis 
can be used to obtain a solution to the problem. The validity of the solution 
depends on how well the mathematical description models the real world. The 
mathematical description is called a mathematical model, and the process of 
obtaining it is called mathematical modelling. 

The use of mathematics is one of many approaches to solving real-world 
problems. Others include experimentation either with scaled physical models 
or with the real world directly. The mathematical approach has a number of 
advantages which can best be illustrated by considering a number of specific 
examples. 

Example / 

What is the thrust level and duration necessary to ensure that a rocket reaches the moon? In this 
problem a solution based on an experimental programme involving rocket launches with different 
thrust-time histories is obviously unacceptable due to the cost and the uncertainty of success. 
Furthermore, the solution to this problem cannot be obtained using scale-model experiments. 
This problem lends itself to a mathematical treatment. 

Example 2 

What are the consequences of changes to the operation of an existing chemical plant? For safety 
and cost reasons it would be undesirable to carry out the changes on the plant itself without first 


3 


< 
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MATHEMATICAL MODELLING 


Step-I 


Step-2 


Step-3 


Step-4 



Figure 1.1 Problem solving using mathematics. 


knowing the outcome. A scaled physical model could be used to obtain the desiredI mformation, 
but this would require special facilities. A mathematical approach would clearly be attractive here. 

Example 3 

Whal will be the growth in the number of airline passengers over the next 5 years? F ° r ^ sts ° f, ^“ 
nature are frequently needed for planning purposes. There is really no scientific alternative 
mathematical treatment for problems of this kind. 

In this book we concentrate on the process of mathematical modelling. Our 
aim is to develop this process in a framework which highlights all aspects of the 
modelling process. Failure to understand and appreciate the relevance of t ese 
different aspects will result in the creation of a mathematical model which is of 
limited use in solving the real-world problem at hand. 

In the remainder of this chapter we discuss the following topics: 

(i) nature of mathematical modelling, 

(ii) approach to the modelling process, 

(iii) an outline of the structure of the book. 

2 The Nature of Mathematical Modelling 

Mathematical modelling is the process by which a problem as it appears in the 
real world is interpreted and represented in terms of abstract symbols This 
abstract description involving a mathematical formulation is called a 
mathematical model of the original problem. In this abstract form the problem 
is divorced from the real world and can be treated in mathematical terms only. 
In other words the underlying formulation may be viewed as a dummy, much 
like a tailor’s dummy, and mathematical modelling as the process of clothing 
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the dummy in terms of the features of the real world. For example, a single 
mathematical formulation (dummy) may be appropriate for a number of 
physical problems with the mathematical symbols taking on different 
meanings (clothes) depending on the problem under consideration. Consider 
the following equation for one dependent variable (</>) in terms of two 
independent variables ( x,t ) and one parameter (a). 

d<p/dt = a d 2 <f>/dx 2 

This single mathematical formulation can be used to describe a number of 
quite different problems. For example, if t represents time and x one spatial 
coordinate, then (j) could be the temperature in a one-dimensional heat 
conduction problem, or species concentration in a one-dimensional diffusion 
problem. 

In order to carry out a mathematical modelling exercise one requires an 
understanding of many different topics. Some topics are needed for dealing 
with the problem as it appears in the real world and others for dealing with the 
formulation belonging to the abstract world of mathematics. In addition, one 
needs skills to relate these two distinctly different worlds - the real world of the 
problem and the abstract world of the mathematical formulation. 

Mathematical modelling is an art as well as a science. The science aspect 
deals with the topics needed to execute the various steps in the modelling 
process. Because no two problems are the same in the real world, features such 
as creativity, intuition and foresight also play an important role in the 
modelling process. These features constitute the art aspect of mathematical 
modelling. This makes mathematical modelling a very challenging, and at the 
same time a very demanding, activity. The science aspect can be appreciated in 
a passive learning mode, whereas the art aspect can be appreciated only by 
learning in an active mode - i.e. by building mathematical models and 
learning from experience. 

Seldom does one obtain an adequate mathematical model at the first 
attempt for the problem under consideration. In general an iterative procedure 
is needed where improvements are progressively made until an adequate 
mathematical model is obtained. The art aspect of modelling is very important 
during this iterative phase. 

3 Approach to the Modelling Process in this Book 

Many different approaches have been advocated for learning the mathemat¬ 
ical modelling process. This is reflected in the large number of books and 
journal articles that have appeared over the past decade on mathematical 
models and modelling, many of which are included in the references at the end 
of this chapter. A more comprehensive and annotated list is given in 
Chapter 17. 

This book employs an approach, the aims of which are as follows: 
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(1) The power of mathematical modelling as a tool for solving a wide variety of 
real-world problems is illustrated with many examples. 

(2) A universal and comprehensive framework is developed for model 
building. This framework identifies the main activities involved in model 

(3) lie important features of each of the main activities involved in modelling 
are discussed separately. Their use is illustrated through a series of case 
studies in which mathematical models are built for selected real-world 
problems. 

(4) All the different activities are brought together to illustrate the modelling 
process in its totality by considering three case studies in depth. 

(5) In the spirit of learning by participation, a number of exercises are given for 
solution by the student with the purpose of developing mathematical 
modelling skills. These exercises consist of a number of real-world 
problems. A brief description is given of each, together with a short list of 
references. 

An important feature of this approach is the extensive use of case studies to 
illustrate the various aspects of mathematical modelling. Twelve case studies 
have been used for this purpose. They have been chosen from a wide range of 
disciplines to encourage a broad understanding of the science and art aspects 
of modelling, and to demonstrate the universal nature of the modelling 

process. 

4 The Systems Approach 

The starting point in any problem-solving activity, whether it involves 
mathematical modelling or not, is to define clearly and unambiguously the 
problem to be solved. This aspect is discussed in Chapter 2 where the ideas are 
applied to the 12 case studies with a view to the subsequent development of 
mathematical models which will be used to obtain solutions. 

As will be seen from Chapter 2, problems typified by the case studies 
considered are all different in some respect. For the purpose of developing a 
methodology for mathematical modelling, it is essential to develop a universal 
framework that facilitates a common approach to all problems. The “systems 
approach” offers such a framework in which problems are viewed as being 
embodied in a system. By viewing the problem in this framework the 
characteristics of the system important to the problem can be clearly 
identified. This is the first step towards the development of a mathematical 
model. This process of “system characterization” is treated in Chapter 3. 

5 Mathematical Modelling and Structure of the Book 

Once the system characterization is completed, the next step is the construc¬ 
tion of the mathematical model. There are many different kinds of models, and 
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these are discussed in Chapter 4. A classification system for mathematical 
models is also given in that chapter, for the purpose of helping the reader make 
a better match between the system characterization and the mathematical 
formulation chosen for the model. 

An overview of various mathematical formulations and examples of their 
application are given in Chapters 5 and 7. This broad overview is intended to 
provide the reader with a knowledge of the variety of mathematical 
formulations available for a particular modelling problem. Notes and 
references are provided at the end of each chapter to help the reader find 
detailed information about selected formulations. This format is an important 
feature of the approach to teaching mathematical modelling in this book, as it 
leads to a unique combination of both breadth and depth. Chapters 6,8 and 9 
give an overview of mathematical tools available for carrying out analysis to 
obtain solution to the problem under consideration. Chapters 10-13 deal with 
steps needed to establish model credibility and its usefulness to yield a 
meaningful solution to the problem in question. Here again the emphasis is on 
giving a broad overview, with notes and references provided to give access to 
more detailed information. 

Chapters 3-12 cover separately the different topics needed for mathemat¬ 
ical modelling. These topics are brought together in Chapters 14-16, where the 
modelling process is discussed in its totality and illustrated by detailed 
application to three case studies. For each case study the complete iterative 
process used to obtain an adequate model for the specified problem is 
presented. 

Notes and References 

The use of mathematics and computers for solving real-world problems is very 
widespread. There is hardly a branch of learning where mathematics and 
computers have not made an impact. 

It is impossible to give a comprehensive list of books on the use of 
mathematics in different disciplines as it would run into hundreds. Instead, we 
give an illustrative sample: [l]-[4] deal with mathematics in biology and 
medicine; [5]-[8] deal with mathematics in social sciences; [9]-[12] deal with 
mathematics in physical sciences; [13]-[16] with mathematics in humanities, 
and [17] and [18] deal with mathematics in sports. 

Two introductory-level books which illustrate the effectiveness of mathema¬ 
tics in solving a wide variety of problems are [19] and [20]. A large number of 
interesting articles, at a fairly introductory level, on the application of 
mathematics to solving a diverse range of real-world problems can be found in 

[21]. . • U 

The activity of problem solving, though practised for a very long time, has 
received formal attention only over the past few decades. The number of books 
on problem solving is again fairly large, and [22]-[26] is a small sample. The 
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morphology of mathematical problem solving has received some attention 
and many alternative characterizations have been proposed. The one shown in 
Figure 1.1 is adequate for our purpose, and will form the basis for subsequent 

discussions. .... «. , 

Problem solving requires various types of thinking - creative, critica, 

deductive, inductive, scientific, intuitive, etc. Again, there are a large number of 
books dealing with different facets of this interesting topic. Items [27]-[30j 
form a small illustrative sample. It is recommended that advanced-level 
students in modelling read one or more of these, or similar, books. 


[1] Gold. H.J.: Mathematical Modeling of Biological Systems: An Introductory Guidebook. 

[2] Belhnan^R ^Mathematical Methods in Medicine. World Science Publishers. Singapore. 

[3] Townsend. J. T. and Ashby, F. G.: The Stochastic Modeling of Elementary Psychological 
Processes Cambridge University Press, Cambridge, 1983. 

[4] Eisen, M. M.: Mathematical Models in Cell Biology and Cancer Chemotherapy. Springer- 

[5] W«LB.^M a/heJncai Model as a Tool for the Social Sciences. Gordon and Breach. New 

[6] Bartholomew, D. J.: Stochastic Models for Social Processes. 2nd edn. Wiley, New York. 

[7] Rapoport, A.: Mathematical Models in the Social and Behavioral Sciences. Wiley, New York, 

rgl Gregory, G.: Mathematical Methods in Management. Wiley, New York, 1983. 

[9] Boas, M. L.: Methematical Methods in the Physical Sciences, -nd e n. tey, 

f jO] Tayler, A. B.: Mathematical Models in Applied Mechanics. Clarendon Press,' Oxford. 1986. 

[11] KoUegoda. N. T.: Stochastic Water Resources Technology. Halstcd Press, Wiley. New York. 

[12] WcUstead. P. E.: Introduction to Physical Modelling. Academic Press. New York lm 

[13] Renfrew C and Cooke. K. L.: Transformations: Mathematical Approaches to Cult 

[ 14 ] ^ZdTw^'E^ ^Language and Literature: A Statistical Approach. 

[15] Dora^J^E^d^u^som^F.^.^Mar/ieniatics and Computers in Archaeology. Harvard 

[16] Hage^P-'and"^^^^'^^^^^/Models in Anthropology. Cambridge University Press, 

[17] Ladany^S*p' and Machol. R. E. (eds): Optimal Strategies in Sports. Elsevier/North Holland, 

[18] M^cS a R , ’E 197 Ladany, S. P. and Morrison, D. G. (eds): Management Science in Sports. 

Elsevier/North Holland, Amsterdam, 1976. . Macmillan New 

[19] Noble, B.: Applications of Undergraduate Mathematics in Engineering. Macmillan, Ne 

[20] Burghes^D. N.. Huntley, 1. and McDonald, J.: Applying Mathematics: A Course in 
Mathematical Modelling. Ellis Horwood, Chichester, 1982. 

[21] The Bulletin of the Institute of Mathematics and its Applications. 1Q?5 

mi Rubinstein M F.: Patterns of Problem Solving. Prentice-Hall, Englewood Cliffs, NJ, 19 5. 

[23] Mayer, R.E.: Thinking and Problem Solving: An Introduction to Human Cognition and 
Learning. Scott, Foresman, Glenview, IL, 1977. 

[24] Kauffman, R. A.: Identifying and Solving Problem: A Systems Approach. University 
Associates, La Jolla, CA, 1976. 

[25] De Bono, E.: Problem Solving. Temple Smith, London, 1981. 
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[26] Robertshaw, J. E. et at.: Problem Solving - A Systems Approach. Petrocelli Books, New 
York, 1978. 

[27] Gilhooly, K. J.: Thinking: Directed. Undirected and Creative . Academic Press, London, 1982. 

[28] Organ. T. W.: Art of Critical Thinking. Houghton Mifflin. Boston, MA. 1965. 

[29] Hall, G. G.: The Application of Mathematical Thinking. University of Nottingham Press, 
Nottingham, 1963. 
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CHAPTER 2 


Problem Definition: 
The Starting Point 


1 Introduction 

The problem that needs to be solved must be clearly defined before any 
mathematical modelling can commence. In problems drawn from the real 
world this is done by describing the context of the problem and then stating the 
problem within this context. In this chapter the process of problem definition 
is illustrated in 12 case studies. These case studies are based on real-world 
problems that will be used throughout the book to teach the modelling 
process. They will be repeatedly referred to in later chapters to illustrate 
different aspects of the mathematical modelling process. 

2 Case Studies 

The 12 case studies are labelled A-L. As discussed in Chapter 1, they have 
been deliberately chosen to cover a wide range of subject matter. Table 2.1 
gives chapter and section numbers where each of the 12 case studies are used in 
the remainder of the book, to illustrate the role and significance of different 

topics in modelling. . . „ „ 

Although it is not essential to read all the case studies, and to follow these 
throughout the book, a better understanding and appreciation of the 
underlying universal framework used to develop mathematical models is best 
achieved by studying a broad sample of the case studies.* 

2.1 Case Study A ( Weight Lifting) 

In weight lifting competitions the lifters are grouped into seven different 
categories based on their body weights. There are two common lifting styles 
formalized in competition - jerk, in which the weight is lifted in two steps, one 

•Those reading the book for the first time are advised to read Case Studies A, D, and E ‘ftheyare 
interested in deterministic modelling, and Case Studies 1 and J if the mterest is in stochastic 
modelling. In subsequent readings they are urged to read the remaining case studies. 


Table 2.1 Section Index to Case Studies A-L 


Case Studies 

ABCDEFGHIJ K L 


Problem definition 
Chapter 2 

System characterization 
Chapter 3 


Mathematical models 
Chapter 4 

Deterministic models 
Chapter 5 


Stochastic models 
Chapter 7 

Model analysis 
Chapter 6 


Chapter 8 

Simulation 
Chapter 9 

Parameter estimation 
Chapter 10 

Design of experiment 
Chapter 11 
Validation 
Chapter 12 


2.1 

2.2 

2.3 

2.4 

2.5 

2.6 

2.2 

4.0 

6.3 

4.0 

3.0 

6.6 

6.1 

6.2 


5.0 

4.1 





6.4 

6.5 


3.2 


3.4 

5.4 

4.0 

4.5 

7.5 





5.5 






6.1 






7.4 


— 

4.2 

— 

— 

— 

— 



5.3 

4.2 

6.2 

8.2 




4.4 

7.2 





7.4 

8.5 


— 

— 

— 

7.6 

— 

— 

— 

— 

— 

5.3 

2.3 

— 

3.5 

— 

— 

2.2 

3.6 

— 

— 

2.4 

— 

— 

— 

— 

4.5 

_ 

_ 

3.1 

_ 

_ 


2.7 

2.8 

2.9 

2.10 

2.11 

2.12 

6.7 

2.1 

2.1 

6.10 

2.1 

2 2 


2.2 

3.0 


6.11 

4.1 


4.2 

5.0 



6.12 


6.8 

6.9 











5.3 

— 

6.7 

4.4 

6.5 

7.4 

_ 

_ 

8.4 

8.5 

_ 

_ 

4.4 


6.3 

3.2 

5.2 




8.4 

8.3 



_ 

_ 

_ 

_ 

3.3 

_ 





5.7 


4.8 

_ 

4.5 

4.2 

_ 

4.9 



4.7 




— 

5.4 

7.2 

— 

— 

— 

3.1 

_ 

_ 

5.6 

— 

— 


5.7 


Table 2.2 Data for Case Study A: Weight Lifts for Different 
Weight Categories 


Body weight class 

Winning weights 

Snap Jerk Total 

Flyweight (52)* 

105 

137 

242 

Bantamweight (56) 

117 

145 

262 

Featherweight (60) 

125 

160 

285 

Lightweight (67.5) 

135 

172 

307 

Middleweight (75) 

145 

190 

335 

Light-heavyweight (82.5) 

162 

202 

364 

Middle-heavyweight (90) 

170 

212 

382 

Heavyweight (100) 

175 

225 

400 


•Mean body weight in kilograms for the class 
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taking it to chest height and the second taking it to full height; and snatch, in 
which the weight is lifted to full height in one motion. Table 2.2 gives the 
different categories and world records of weight lifted in jerk and snatch 
motion for each of the seven categories. If there are enough lifters in each 
category, separate competitions can be run for each category. What contest is 
possible if there are too few competitors for separate competitions in eac 
category? What is the greatest athletic achievement of a group of weight lifters 

of widely differing weight? . . 

One solution to this problem would be a single competition for all weight 
categories with the winner decided using a handicapping rule designed to 
compensate for any body weight advantage. The problem is then to develop a 
meaningful handicap rule based on the data given in Table 2.2. 

2.2 Case Study B (A Hoy Selection) 

Titanium alloy is used in the manufacture of high-precision components for 
many industries. Alpha Inc., a company specializing in metallurgical pro¬ 
cesses, has decided to set up a plant to produce titanium alloy. The alloy can be 
produced by two different methods and Alpha Inc. has the option of choosing 
the ore from four different mines. Alpha Inc. is interested in producing the 
alloy with the smallest thermal coefficient. The problem facing Alpha Inc. is to 

decide: 

(i) on the method to be used, and 

(ii) on the mine from which it must get the ore. 

2.3 Case Study C ( Optimal Production ) 

Milk can be consumed either directly or in processed forms such as cream 
yoghurt and cheese. Beta Inc. is the sole producer of milk in a certain region and 
also owns the factory to process the milk into various forms. Beta Inc., like any 
other company, wants to maximize its total profit. The problem for the 
company is to determine the relative quantities of the various products 
necessary to achieve this goal. 

2.4 Case Study D (Rocket Launch) 

For meteorological purposes, rockets are launched so that they reach a certain 
altitude and record atmospheric conditions such as temperature and pressure. 
The problem here is to calculate the rocket thrust necessary to ensure that the 
rocket reaches the desired altitude. 

2.5 Case Study E (World Population) 

Many world organizations, for example the United Nations and its agencies, 
deal with global issues such as total food production and consumption ofnon- 
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Table 2.3 Data for Case Study £: Total 
World Population (1850-1980) 


Year 

Population (millions) 

1850 

1262 

1900 

1650 

1920 

1860 

1930 

2069 

1940 

2295 

1950 

2515 

1960 

2998 

1970 

3651 

1980 

4368 


renewable resources. Growth in the world population is important in all such 
issues. Table 2.3 gives statistics for the world population for the last 100 years. 
The problem here is to obtain estimates for the total population for the next 
100 years, with and without birth control policies. 

2.6 Case Study F (River Pollution) 

Gamma Inc. is a company which produces chemicals and is located on the 
banks of a river. In the past it has disposed of its waste by indiscriminately 
dumping it into the river, causing high levels of pollution. As a consequence, 
the local authorities passed new legislation with very high Fines if the pollution 
in the river exceeded certain specified concentration limits. The problem here 
is to find a policy for discharging the waste so as to ensure that the 
concentration levels never exceed the specified limits. 

2.7 Case Study G ( Weather Changes) 

The operators of a holiday resort complex have been using liquid fuel for 
heating their swimming pool so that the temperature of the pool is around 27 
degrees Celsius. Due to the rising cost of liquid fuel, they have decided to look 
at alternative methods of heating the pool. Solar energy offers one such 
alternative. However, 8 hours of sunshine per day are needed for the pool to be 
maintained at the desired temperature. On days when the sun is shining for less 
than 8 hours liquid fuel must still be used to make up for the lack of solar heat. 
The operators have collected information regarding sunshine over an interval 
of 63 days. The information is as shown in Table 2.4, where each day is 
classified as being sunny (if there was 8 or more hours of sunshine); partly 
sunny (if the sunshine was between 4 and 8 hours) or cloudy (if the sunshine 
was for less than 4 hours). The problem of interest to the operators is to decide 
whether it is economical to install the solar heating system. 
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Table 2.4 Data for Case Study C: Weather Changes in the Holiday Resort 


Day 



S: Sunny; P: partly sunny; C; cloudy. 


2.8 Case Study H (Thermal Power Station) 

A thermal power station is a large complex system which produces electricity 
by burning coal. Basically, the heat released by the burning coal is used to 
convert water into steam, which in turn drives a turbine to produce rotational 
motion. This motion causes a generator to convert the mechanical energy into 
electrical energy. Many problems arise in this context. A few are given 

below: 


(i) For economic planning: when should the next power station be built, how 
large should it be and where should it be put? 

(ii) For control purposes: how does the alternator or boiler respond to 

fluctuating load? 

(iii) For design purposes: for a given output what is the necessary size of boiler 
and superheater? 


2.9 Case Study 1 (Component Reliability) 

Large complex machines, such as computers or rockets, are composed of a 
large number of components. Components are prone to failure, and the time to 
failure (or the age at failure) is uncertain and unpredictable. Table 2.5 gives the 
age at failure for a sequence of failures of a control element for a large machine. 
The machine is inoperable without this element. Thus, whenever the element 


Tabll 2.5 Data for Case Study I: Age al Failure for Control Element 


( x 100 hours) 


18487 0.3761 0.7500 

1.8802 1 5700 1 7708 

| 7961 1.5319 0.5903 

1.6560 1 7172 19310 

1 3162 0.7705 1 8889 


3.0530 1.3545 

1.3592 3.0466 

0.6288 0.6461 

1.0509 1.6173 

1.7410 4.1515 
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fails, it is replaced by a new one so that the machine becomes operable. The 
problem is to evaluate: 

(i) how long the machine can be kept operational if there are n spares of the 
control element, 

(ii) how many replacement elements are needed to keep the machine 
operational for T units of time. 

2.10 Case Study J (Speed of Aeroplanes) 

In modern industrial society changes are taking place at a very rapid rate. One 
example of this is the increase in aircraft speed over the years. Table 2.6 gives 
the maximum speed of aircraft as a function of time from 1906 to 1965. 
Frequently military and civil aviation planners need to know what perfor¬ 
mance aircraft are likely to have at some future time, say in the year 2000. The 
problem here is to predict the likely maximum speed in the year 2000. 

2.11 Case Study K (Supermarket Operation) 

In large supermarkets there are usually a large number of normal, and a few 
fast, checkout counters. People with less than a specified number of items 
stand in line at fast checkout counters, while others go to normal checkout 
counters. Fewer counters imply long queue lengths and this can affect 
customer goodwill. On the other hand, if there are too many checkout 
counters, manning them reduces the total profit. The problem is to decide on 
the optimum number of checkout counters that the supermarket should have. 

2.12 Case Study L (Demand for Soft Drinks) 

Cool Inc. is a company which manufactures soft drinks. It is thinking of 
expanding its plant capacity so as to meet future demands. The monthly sales 
for the past 5 years are given in Table 2.7. The problem is to obtain good 
estimates for future demand so as to help the company make the right decision. 

3 Need for a Unified Approach 

As seen from the case studies of the previous section, each problem is distinct 
and differs from any other. A comprehensive framework, to treat such 
distinctly different problems in a unified manner, is highly desirable and indeed 
essential. The systems approach offers such a framework. 

In the systems approach, the real world associated with the problem is 
viewed as a system. Identifying the features of the system that are relevant for 
the problem is called system characterization, and is discussed in the next 
chapter. 
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Table 2.6 Data for Case Study J: Speed of Aeroplanes 


Date 


Nov. 12, 1906 
Oct. 26, 1907 
May 20, 1909 
Aug. 23, 1909 
Aug. 24, 1909 
Aug. 28, 1909 
Apr. 23, 1910 
July 10, 1910 
Oct. 29, 1910 
Apr. 12, 1911 
May 11, 1911 
June 12, 1911 
June 16, 1911 
June 21, 1911 
Jan. 13, 1912 
Feb. 22, 1912 
Feb. 29, 1912 
Mar. 1, 1912 
Mar. 2, 1912 
July 13, 1912 
Sept. 9, 1912 
June 17, 1913 
Sept. 27, 1913 
Sept. 29, 1913 
Feb. 7, 1920 
Feb. 28, 1920 
Oct. 9, 1920 
Oct. 10, 1920 
Oct. 20, 1920 
Nov. 4, 1920 
Dec. 12, 1920 
Sept. 26, 1921 
Sept. 21, 1922 
Oct. 13, 1922 
Feb. 15, 1923 
Mar. 29, 1923 
Nov. 2, 1923 
Nov. 4, 1923 
Dec. 11, 1924 
Sept. 5, 1932 
Sept. 4, 1933 
Dec. 25, 1934 
Sept. 13, 1935 
Nov. 11, 1937 
Mar. 30, 1939 
Apr. 26, 1939 
Nov. 7, 1945 
Sept. 7, 1946 
June 19, 1947 
Aug. 20, 1947 
Aug. 25, 1947 
Sept. 15, 1948 
Nov. 19, 1952 
July 16, 1953 


Country 


France 

France 

France 

France 

France 

France 

France 

France 

United States 

France 

France 

France 

France 

France 

France 

France 

France 

France 

France 

France 

United States 

France 

France 

France 

France 

France 

France 

France 

France 

France 

France 

France 

France 

United States 

France 

United States 

United States 

France 

France 

United States 

United States 

France 

United States 
Germany 
Germany 
Germany 
Great Britain 
Great Britain 
United States 
United States 
United States 
United States 
United States 
United States 


Speed (mph) 


25.66 

32.75 

34.06 

43.38 

46.18 
47.84 
48.21 

66.18 
68.20 
69.47 
74.42 

77.67 
80.91 

82.73 
90.20 
100.22 
100.94 

103.66 
104.33 
106.12 
108.18 

111.73 
119.24 

126.67 
171.04 

176.14 
181.86 
184.36 
187.98 
192.01 
194.52 
205.22 
211.90 

222.97 
233.01 
236.59 

259.15 
266.58 
278.48 
294.38 

304.98 
314.319 
352.388 
379.626 
463.917 
469.220 
606.255 
615.778 
623.738 
640.663 
650.796 
670.981 
698.505 
715.745 
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Table 2.6 {Contd.) 


Date 

Country 

Speed (mph) 

Sept. 7. 1953 

Great Britain 

727.624 

Sept. 25. 1953 

Great Britain 

735.702 

Oct. 3. 1953 

United Stales 

752.943 

Oct. 29. 1953 

United States 

755.149 

Aug. 20, 1955 

United States 

822.266 

Mar. 10. 1956 

Great Britain 

1132.136 

Dec. 12. 1957 

United States 

1207.6 

May 16. 1958 

United States 

1404.09 

Oct. 31. 1959 

U.S.S.R. 

1483.85 

Dec. 15. 1959 

United States 

1525.965 

Nov. 22. 1961 

United States 

1606.48 

July 7. 1962 

U.S.S.R. 

1665.9 

May 1. 1965 

United States 

2070.102 


Table 2.7 Data for Case Study L : Monthly Sales ( x 1000 litres) 


Year 


Month 

I9S3 

I9H4 

1985 

I9H6 

1987 

January 

629 

747 

874 

997 

1095 

Febuary 

642 

781 

896 

1008 

1102 

March 

693 

812 

936 

1044 

1146 

April 

694 

811 

952 

1054 

1154 

May 

682 

822 

932 

1043 

1138 

June 

686 

814 

929 

1040 

1142 

July 

689 

815 

938 

1062 

1139 

August 

718 

826 

940 

1048 

1172 

September 

743 

852 

944 

1076 

1168 

October 

755 

878 

999 

1092 

1148 

November 

767 

852 

994 

1082 

1170 

December 

739 

805 

986 

1124 

1192 


Notes and References 

The problems discussed in this chapter, though varied, do not cover all 
disciplines of learning. For example, literature, law, and health are a few not 
represented. A problem in literature where mathematics and computing has 
proved to be extremely useful is the following. Is an old text (e.g. a book of the 
Old Testament) written solely by a single author, or is it the work of multiple 
authors? Mathematical models can be built to answer such questions. 
Interested readers can find details of such models in [1] and [2]. 

Problems can be classified in two ways. Firstly, they can be classified by the 
discipline to which they belong for example, social science, biological 
science, technology and so on. Further subdivisions exist - for example, in the 
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case of technology it would be chemical engmeermg mechanical eng.neenng 
civil engineering and so on. Secondly, problems can be classified into different 
groups based on the nature of the problem - for example problem of analysis, 
of synthesis, of optimization, of forecasting, to name a ew. 

For additional problem definitions see Chapter 1 . 


[,] Brainerd. B, Weighing Evidence in Language and Li,era,are: A S,a,is,ica, Approach. 

[2] SSE En6,ewood c,ifrs - NJ> 1972 


CHAPTER 3 


System Characterization 


1 Introduction 

In the previous chapter we described a large number of real-world problems in 
the form of Case Studies A-L, and mentioned that the systems approach 
offered a universal framework for treating such distinctly different problems. 
In the systems approach the real world associated with the problem is viewed 
as a system, and the solution to the problem as a study of the system with a 
defined goal. The starting point for such a study is a proper description of the 
system. 

In order to describe the system, one needs a clear understanding of the terms 
and concepts used in the systems approach and the precise meaning attached 
to them. Also, any significant problem in its real-world form is usually very 
complex. Hence, a total description of it, when viewed as a system, becomes 
involved and unmanageable. Fortunately, not all features of the real world are 
relevant to the problem or its solution. Consequently, a partial description is 
often adequate. This partial description, called the “system characterization", 
retains only those features of the real world that are relevant for the problem to 
be solved. Thus, the system characterization phase, as shown schematically in 
Figure 3.1, can be viewed as a process of simplification, and idealization. 

The system characterization can be done in any natural language, for 
example English. However, alternative methods, such as schematic or graph- 
theoretic or matrix representation, are more compact and elegant represent¬ 
ations. As such they present the problem in a form that is easier to comprehend 
and facilitate the first conceptual steps towards mathematical model building. 

In this chapter we describe the process of system characterization. We 
discuss various terms and concepts used in the systems approach and make the 
meaning clear by applying them to Case Studies A L introduced in the 
previous chapter in the form of examples. 

2 Some Basic Concepts 

In this section we start by defining a system, and discuss some basic terms and 
concepts used in the systems approach. In subsequent sections we discuss 
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Figure 3.1 Systems approach to real-world problem solving. 


further concepts that play an important role in developing a system 
characterization. 

2.1 System / Variable! Parameter 

A “system” can be defined as a collection of one or more related objects. An 
object is a physical entity with specific characteristics or attributes. Theotje 
cai be either non-interacting or interacting in some sense. Of special interes 
are systems where there is interaction between the objects, since these most 

closely resemble the real world. _ „ „ j 

The attributes of an object are described in terms of parameters 
“variables”. Parameters are attributes intrinsic to an object. Variables are 
attributes needed to describe interaction between objects. Often, parameters 
and variables are terms from some theory or theories, if such exist, and are used 
to explain the state and behaviour of the system. When no such theory exists, 
variables and parameters are words from a natural language with the usual 
meaning. We will illustrate these points by a few examples. 


Example I ( Thermal Power Station: Case Study H ) 

magnetism. 


Example 2 (Supermarket Operation: Case Study K) 
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the shop, number of customers waiting al the counters, number of items purchased by each 
customer and so on. Two system parameters are (i) the price of each product and (ii) the number of 
counters. 


Example i (Component Reliability: Case Study I) 

The control element can be viewed as a single-object system. The variables describing the system 
would be the state of the element - i.e. working or failed, and the age of the element if it is working. 


2.2 System/Environment 

The way we have characterized a “system”, so far, implies that it is either an 
isolated single object or an isolated collection of objects. In reality this is 
seldom the case. The system under study is usually a subset of some bigger 
system - more appropriately labelled a “super-system”. This super-system is 
usually the observer and the universe known to him or her. 

The objects in the system under study might or might not interact with 
objects of the super-system which do not belong to the system. The system is 
said to be a “closed system” if there is no such interaction and an “open 
system” if there is. 

The “system environment” is the term used to describe those objects that are 
outside the system but interact with it. The interaction between the system and 
its environment is through variables common to both. This raises the natural 
question - When does an object belong to the system and when does it belong 
to its environment? If an object interacts with the system in the way described 
above, should it not be considered a part of the system, since the system and its 
environment make up the universe of interest. Subdivision of this into two 
parts - system and environment - is mainly for convenience and quite 
arbitrary, since the division can be done in many ways. We will illustrate this 
by a few examples. 


Example 4 (Thermal Power Station: Case Study H ) 

The thermal power station produces electricity from coal. The coal is obtained from mines. The 
generated electricity is fed into a distribution network which distributes the electricity to users. We 
can regard the coal mine as an object outside the system which affects the system. The network is 
regarded as another such object but, in this case, is affected by the system. The system becomes 
more complicated if we include these objects within its boundaries. 

Often diagrammatic displays are more effective than natural language in describing the system. 



Figure 3.2 System/environment interface for Example 4. 
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the environment. 


Example 5 ( Weight Lifting: Case Study A ) 

The real world of the problem can be viewed as a system with two objects - i.e. the weight lifter and 
It. • weiuhts for liftineThc variables describing the attributes of the objects arc the body weight of 
the Hftefanf the weight lifted. Since the interest is on weigh, lifted, we can view the system as a 

simplification mentioned in Section 3.1. 


Example 6 (Demand for Soft Drinks: Case Study L) 

When the real-world problem m this case study ,s viewed as a system, one ' -•ab^r importance is 

etr£S S'n JSStf-Stai the system as a closed system On the other 

hanil.Vfonetncludes variables such as population changes, weather conditions, advertising etc., 
then we view the system as an open one. as shown tn rigure 3.3 

These examples show particular choices of system boundary which are 
satisfactory for the problems in question. Other boundar.es could have been 
chosen but would have led to different characterizations. The selection of a 
system boundary is one component of the art aspect of mathema ica 
modelling there is no universally correct division between system and 
environment. Novel and useful characterizations can follow from different 
choices of system boundary. 


2.3 Relationships (Interaction between Objects) 

The interactions between objects are described through relationships linking 
the variables of the interacting objects. Of particular importance are cause 


Environment 



Figure 3.3 System environment interface for Example 6. 
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effect” interactions producing what are termed “causal relationships”. Such a 
relationship may be illustrated by the following well-known proverb: 

For want of a nail a shoe was lost; for want of a shoe a horse was lost; for 
want of a horse a rider was lost; for want of a rider a battalion was lost; for 
want of a battalion a battle was lost; for want of a victory a kingdom was 
lost - all for want of a nail. 

Causal relationships are best indicated by means of either graph-theoretic or 
matrix tabular displays, as shown by the following examples. 


Example 7 (Advertising) 

Companies use advertising to improve their sales, and hence revenue and profits. However, 
advertising is an expensive operation affecting company profit. The problem in advertising is to 
decide how much and what kind of advertising is justified. From the systems approach point of 
view the real world of the advertising problem can be viewed as a closed system with three 
variables - sales, profits and advertising expenditure. Advertising expenditure affects sales, which 
in turn affects profits. The profits, in turn, affect advertising expenditure. We can display the causal 
relationships, very elegantly and compactly, by using a graph-theoretic display, as shown in 
Figure 3.4. In this display nodes represent variables, and directed arcs indicate the causal 
relationships. The direction of the arrow points away from the variable which is the cause and 
towards the variable which is being influenced. We will find that graph-theoretic representations 
of this kind arc extremely useful in succinctly characterizing the system under study. 


Graph-theoretic display 



Matrix tabular display 



Advertising 

Sales 

Profits 

Advertising 

0 

1 

0 

Sales 

0 

0 

1 

Profits 

, 

0 

0 


Figure 3.4 Alternative displays for Example 7. 
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corresponds to no such causal relationship. 

In some systems (especially those associated with problems in social 
sciences) it i/not easy to identify and define causal relationships between 
variables. In some cases there can be more than one cause for an effect, as 
illustrated by the following example. 


Example 8 (Death of Pedestrian) .... . ., 

•« «%“»» T““ 'sjr' “ 12K2 . 

ri’h“u £ d. . ,o Ml Which cause „ causes ... -P»™~ **«*•- 
the goal or purpose of study. 


3 Degree of Detail 

Consider an open system that is described only by the variables through which 
it interacts with its environment, and the inner structure of the system is 
ignored. This description is called a “black-box” description. Such a situation 
occurs when one has very little understanding of the inner structure of 
system. Even when there is some knowledge of the inner structure, i issi 
sometimes convenient to view it as a “black box” for the purpose of deriving a 
simpler description of the system. In either case the inner details are being 
ignored. On the other hand, if one can describe all the objects in the system an 
their attributes (through variables and relationships) then one *s d^ n g the 
system in greater detail. In this case the system is being viewed as a whue box 
^ contrast to a "black box”). More appropriately, it should be called a 

* Thedegrw oTdetail needed to describe a system appropriately de P ends °" 
many factors. If all the details are included in the description il car. become 
unmanageable and hence of limited use. However, d s,gn “ deta ^ a ^ 
omitted the description is incomplete and, once again of limited use in 
carrying out the study. Hence, one needs to find a sensible compromise. Th 
seTrch for this compromise is a part of the “art aspect” of the modelling 
process. We illustrate these points by a few examples. 


Example 9 (Component Reliability: Case Study I) 
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failure is due to a crack in metal. This would result in a more detailed description with more 
variables and relationships. 


Example 10 (World Population: Case Study E) 

For this problem a system characterization which differentiates the population by (i) age. (ii) sex 
and (iii) geographic location contains more details of the real world than a description where all 
these are lumped together. 


4 Static vs Dynamic 

In static systems time does not play any part, and hence the variables and 
relationships describing the system are time-independent. In contrast, in 
dynamic systems, time plays a very important role with the variables and/or 
relationships describing the system changing with time. We illustrate by 
considering two examples. 

Example II (Alloy Selection: Case Study B) 

The problem can be described in terms of a closed system with three variables. 

A: coefficient of expansion for the alloy, 

B. method of production, 

C: supplier. 

The causal relationships between the variables are as shown in Figure 3.5. In this description time 
plays no role and hence the system is a static system. 


Example 12 (Rocket Launch: Case Study D) 

The problem can be described in terms of a closed system consisting of two objects - the rocket 
and the earth. The variables describing the rocket are its position and velocity relative to some 
fixed point on the earth, and the interaction between the two objects is given by the theory of 
dynamics. In this description we have excluded the influence of other planets in the solar system. If 
these influences are important we can include them by treating the planets as belonging to the 
environment of the system and the system being viewed as an open system (as opposed to a closed 
system). In either case the variables - i.e. position and velocity of the rocket - change cont¬ 
inuously with time. Hence the system is a dynamic system. 



Figure 3.5 Graph-theoretic display of causal relationship for Example 11. 
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In dynamic systems time enters into the cause -effect interactions. Let x and 
y represent two variables of a dynamic system with changes m x over time 
being the cause for changes in y over time. Since only the past can influence the 
present and the future, and not the reverse, >*(/) (the variable y at time t is 
affected only by x(r) with r < t and not by x(t) with x > t. This is called the 
“principle of causality”, which all physical systems obey. 


4.1 Continuous Time vs Discrete Time 

In dynamic systems either some or all of the variables are changing with time. 
One may ask whether the changes take place continuously with time or 
whether they occur at discrete instants of time. Experimental^ U is not 
possible to establish that a variable is changing continuously with time, since 
we are restricted by some finite resolution of time for each observation. Thus, 
one has to make an a priori decision whether to treat the time element as 
continuous or discrete. If the time element is treated as being continuous, then 
the variables of the system need to be described for all time instants over the 
time interval of interest. On the other hand if it is treated as being discrete, then 
the variables need to be described only for the relevant time instants. 

A continuous time description is too detailed in many instances. In sue 
situations one chooses a discrete time description by arbitrarily discretizing 
the time If the interval of discretization is large one obtains a coarse description. 
As this interval becomes smaller the description becomes more refined and 
detailed, and ultimately approaches the continuous time description. 

The decision to treat the time element as continuous or discrete is often a 
subjective one, and once again involves the art aspect of modelling. However, 
in certain instances the data available are such that a discrete treatment of time 
is the most appropriate decision. We illustrate this point by a few examples. 


Example 13 (Demand for Soft Drinks Case Study L) 

If one is interested in weekly demand, then the variables of the system characterization change 
weekly and hence the time element is to be treated as discrete as opposed to continuous^ 
Obviously, if the demand interval is monthly, then the interval of discretization is larger, resulting 
in an aggregate or coarser description. 


Example 14 (River Pollution: Case Study E) 

The river pollution problem is best described by treating the lime element as being 

with the variables of the system description (e.g. concentration level in the river at a given location) 

changing continuously with time. 


4.2 Time Scales in Dynamic Systems 

In dynamic systems the variables change with time. The term “time scale” is 
used to indicate the duration for significant changes to occur in a variable. 
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Table 3.1 Time Scales for Example 16 

Time scale of interest Relevant theory for interaction 


r 0 = r 0 /u 

(Collision time scale) 


t -X/u 

(Kinetic time scale) 

T = L/u 

(Macroscopic time scale) 


Particle dynamics 
(Deterministic and reversible) 

Kinetic theory 
(Statistical and reversible) 
Continuum theory 

(Field theory and phenomenological) 


Often, when variables change at different rates, there can be more than one 
time scale. 

If the time interval of study is T, then variables with time scales very much 
larger than T can be approximated as not changing over the time interval of 
study, thus reducing the number of dynamic variables needed in the 
characterization. Variables with time scales smaller than T, or comparable to 
T, need to be described for all time instants as they change significantly over 
the time interval of study. Identifying the different time scales is a part of 
system characterization. 


Example 15 (Thermal Power Station: Case Study H) 

Suppose that the problem is to design an efficient boiler. In the systems approach the boiler can be 
viewed as a system consisting of many objects - coal particles, burner, heat exchange tubes, boiler 
drum. etc. Typically the heat is released by the burning of coal particles in a coal-fired power 
station. The typical size of coal particles is - 50//m. The process of heat release is complex. First, 
the particle goes through a process of volatilization, during which hydrocarbon gases are released 
and the coal is transformed into char. Depending on particle size, the time scale (T,) for this is 
typically - 20 ms. Then the char burns to ash and typically the time scale ( T 2 ) for this is - 200 ms. 
Since the coal particles and ash are transported in a stream of hot gases, we have a third 
time scale (7’ 3 ) the duration for which the particle (initially as coal, then as char and finally ash) 
stays in the boiler. This duration is given by the length of path that the particle has to traverse in 
the boiler divided by the mean velocity of the hot gases. Proper design requires that T 3 > T 2 + 7",, 
or else heat is wasted in the chimney. 


Example 16 (Magnetohydrodynamic System) 

The operation of a magnetohydrodynamic (MHD) system can be described by the motion of 
charged particles (ions and electrons) orbiting the lines of a magnetic field. There are at least five 
different length scales, such as the range of intcrparticle potential r 0 ; the mean free path A; the two 
radii of gyration r u - u r ( eB/M ,) and r Li - u t (eB/M t ) where u e [wj and M, [MJ are the mean 
thermal speed and mass of electron [ion] and the macroscopic length L (the size of the system); e is 
the charge of electron and B is the strength of magnetic field. On dividing these lengths by u e and u, 
we arrive at a set of time scales. Thus for the electron (and similarly for the ion) we have the 
hierarchy of time scales as shown in Table 3.1. The two other gyro-times M e /eB and M JeB can lie 
anywhere on the lime scales. Table 3.1 also shows the relevant theories needed to describe the 
interactions in the system. 


5 Deterministic vs Stochastic 

A system is said to be deterministic if the values assumed by the variables (for a 
static system) or the changes to the variable (for a dynamic system) are 
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predictable with certainty. If not, then uncertainty is a significant feature of the 
system and the values assumed by the variables, or the changes in the variable, 
are random and unpredictable. Such systems are called either probabilistic or 

In real life there is always uncertainty. If the uncertainty is insignificant then 
it can be ignored and the system can be treated as a deterministic systerm This 
is a process of simplification or idealization. If the uncertainty is significant 
then it cannot be ignored and must be taken into account in the system 
characterization. 


Example 17 (Component Reliability: Case Study I) 


Example IS (Rocket Launch: Case Study D) 

The variables of the system are the position and velocity of the rocket. Since the laws of classical 
dvnamks describe the motion fairly accurately, the changes in position and velocity can be 
predicted SbVhf of certainly. Hence, in .his case we can view the system as being 

deterministic. 

6 System Characterization of Case Studies A-L* 

In this section we carry out the system characterization of the real-world 
problems defined in Case Studies A-L in Section 2 of Chapter 2. These system 
characterizations will be used later in Chapters 5 and 7 to build mathematical 
models. 

6.1 Case Study A [Cont.] (Weight Lifting) 

The weight (L) lifted by a weight lifter in either jerk or snatch motion depends 
to a large extent on his body weight ( W) and to a small extent on various other 
factors - some internal to the weight lifter and others external. Some of the 
internal factors are the physical and mental conditions of the lifter, while some 
of the external ones are the physical conditions of the location where the 
competition is taking place and the spectator response. Often these external 
factors play a very minor role and hence can be ignored. As a consequence the 
problem can be described in terms of a closed system with two objects - the 

weight lifter and the weight. . 

Since the interest is in the weight lifted in jerk and snatch motions, the tim 
history of the motion is of no significance; hence the system can be viewed as a 
static system. Also, since a weight lifter can repeat his best lifts with a fair 
degree of predictability, we can treat the system as a deterministic system. 


Sec footnote on page 10. 
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The simplest description involves only two variables - the body weight of 
the lifter (W) and the weight lifted (L). The weight lifted is different for snatch 
and jerk motions as in the snatch the weight is pulled from the floor to a 
locked overhead position in a single move; whereas for the jerk two moves are 
allowed - the first to the chest and the second to the overhead position. A 
slightly more detailed description is one where the body weight is divided into 
two components muscular and non-muscular. 

The goal for the study is to find a handicapping rule to decide the winner 
when body weights of competitors vary significantly. We do this by finding a 
relationship between L and W using the data given in Table 2.2 and then 
obtaining the handicapped weight lifted (L) by scaling L in terms of W using 
the relationship. The winner can then be decided on the basis of the highest 
score for L. 

6.2 Case Study B [Cont.] (Alloy Selection) 

Alpha Inc. is interested in producing titanium alloy with the least possible 
thermal coefficient. As there is more than one method of producing titanium 
alloy and more than one source for the ore needed, the problem facing Alpha 
Inc. is first to establish whether there are significant differences in the thermal 
coefficient of alloys produced by different combinations of ore and method of 
production. If the differences are not significant the decision is simple, since it 
does not matter from which mine the ore is obtained or the method of 
manufacture. When the differences are significant, the decision is again simple 
if, for each combination of ore and method, the variability in the thermal 
coefficient is small and insignificant. In this case the particular combination of 
ore and method which yields the smallest value for the thermal coefficient is 
the one that Alpha Inc. should select. 

We will consider here the more complicated case in which the thermal 
coefficient varies significantly for all combinations of ore and method of 
manufacture. This could be due to many factors - the variability in the ore due 
to geological conditions and variability in the method of manufacture due to 
uncontrollable factors. These variations are unpredictable; hence the vari¬ 
ations in the thermal coefficient of the alloys are also unpredictable. 

In this case the problem can be described in terms of a closed system with 
four significant variables; (i) the mine from which the ore is obtained (MA), 
(ii) the method used in manufacture (ME), (iii) the thermal coefficient of the 
alloy ( TC ) and unknown influences (A). Other variables can be ignored as 
Alpha Inc. has no control over them and their effect on TC is accounted for 
through X. 

The four different mines can be labelled as MA { (/= 1,4) and the two 
methods as MEj(j = 1,2). As a result we have eight different alloys - obtained 
by different combinations of manufacturer and method. These we label as T C y 
(i= 1,4; j = 1,2) with TC tJ representing the thermal coefficient of the alloy 
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Figure 3.6 System characterization for Case Study B. 


produced by using ore from MA, and method ME s . TC „ is a random variable 
due to the influence of X. 

Following the standard terminology of the des.gn of experiment. we shal 
call "ore from mine” and "method” factors which affect the thermal coefficient 

° f Since Alpha Inc. is interested only in the thermal coefficient of the alloy, time 
plays no role; hence the system can be viewed as a static system. e 
relationship between the variables is as shown in Figure 3 „ 6 ’. ,n< J lcat '" g 
the method and the manufacturer can affect the thermal coefficient of the alloy. 

The goal of the study is first to establish whether ore from different mines 
and method of manufacture affect the thermal coefficient significantly, and 
second to determine the right decision that Alpha Inc. must make with regard 
to the ore and the method of manufacture. 

6.3 Case Study C [Cont.] ( Optimal Production ) 

Beta Inc. manufactures a range of products from milk. Since the aim of Beta 
Inc. is to maximize profits by varying product distribution, an important 
variable is the time interval over which the company wishes to ™ ax ! m, “ £ 
profits. Let T denote this time interval. If T is sufficiently large, then i 
approximated as being infinite, and the aim is then to maximize the profits pe 
unit time. The reason for treating the time interval as being mfirnt11 tha This 
allows the use of asymptotic methods to obtain the optimal solution. If T is not 
sufficiently large, then this approximation is not valid. In either case th 
variables of the system characterization, when the real world associated with 
the problem is viewed as a system, change with time^Thus, time: is an 
important element and the system must be treated as a dynamic syste _ 

The time element can be treated as either discrete or continuous Cont¬ 
inuous time characterization is too detailed and hence not appropriate.^ The 
unit of discretization could be either a day, week or month. There could be 


SYSTEM CHARACTERIZATION 


31 


significant day-to-day variations in the delivery of certain products, payments 
received and so on. The delivery variations are made possible by shelf-life of 
many days for most products. Clearly a day unit would provide a more 
detailed description than is necessary for profit optimization. Most of these 
fluctuations, irrelevant to the present study, would disappear if a week was 
chosen as the unit of discretization. A month would be too long and 
information about the factory performance would be lost with such a coarse 
unit of discretization. For these reasons a week is the most appropriate time 
interval of discretization for the problem. 

If we assume that there is no carry-over from week to week, then each week 
can be treated independently. Hence the aim of maximizing profits over the 
interval T is equivalent to maximizing profit for each week separately. As a 
result we can ignore the dynamic nature of the system and treat the operation 
over each week using a static characterization, thus making the system 
characterization a lot simpler. 

We use the following notation to represent aggregate weekly values. 

X: milk delivered to factory (in litres) 
y l ; milk used for producing butter (in litres) 

Z,: amount of butter produced (in kilograms) 

S,: amount of butter sold (in kilograms) 

D,: demand for butter (in kilograms). 

Similarly, Y 2 , Z 2 , S 2 and D 2 correspond to yoghurt; Y 3 , Z 3 , S 3 and D 3 
correspond to cream. 

y 4 : milk used for sale as processed milk (in litres) 

D 4 : demand for processed milk (in litres). 

The system characterization for the problem is as shown in Figure 3.7. Note 
that the system is an open one with five input variables (X and D„i =1,4) and 
four output variables (S„ i =1,4). The remaining variables are internal to the 
system. The parameters of the system are Cj, s 2 and Tj (j= 1,3) with c, 
representing the cost of producing one unit of butter; s, representing the selling 
price of one unit of butter; T, representing the number of units of milk required 
to produce one unit of butter and so on. c 0 represents the cost of producing one 
unit of milk before processing. 

In the system characterization shown in Figure 3.8a for the case of butter 
production we have the sale price for the products fixed and not affected by the 
amount produced. However, the amount sold is affected by the sale price. 
Since it makes no sense to produce more than the demand, as there is no carry¬ 
over from week to week, we have constraints of the form Z, < D, (i = 1,4). In 
addition we have various other constraints e.g. T, + Y 2 + T 3 + V 4 < X and 
so on. 
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c 4 


Figure 3.7 System characterization for Case Study C. 

On the other hand, if the sale price is influenced by the quantity produced, 
then prices need to be treated as variables with a causal relationship as.shown 
in Figure 3.8b. Similar relationships hold for other processed products 

If we assume that everything produced is sold, and that the uncertainty in 
demand is small, then we can treat the system as deterministic. In this case th 
problem is described in terms of an open deterministic static system. 

P The goal for the study is to obtain the optimal W = 1.4), for a given X, so as 
to maximize the profit which is given by the difference between the revenue 
generated by sales and the costs incurred. 

Comment. If carry-over of products from one week .0 another is allowed, 
then we cannot treat each week separately. In this case the variables are 

seauences ie Z,(i), i = 1,2_represents the amount of butter produced in 

week /. Because of carry-over we need a set of additional ; ariab ' e ^ 
the level of stocks of different processed products at the end of each week. 
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Figure 3.8a Partial system characterization for Case Study C (selling price fixed). 



Figure 3.8b Partial system characterization for Case Study C (selling price not fixed). 



Z,(t) D,(i) 


Figure 3.9 Partial system characterization for Case Study C (carry-over from week to week). 


Thus, for the case of butter we would have an additional variable ST x (i) to 
represent the stock at the end of week /. The relationship between the variables 
associated with butter are as shown in Figure 3.9. In this case the system 
characterization is more complex than the one discussed earlier. 
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6.4 Case Study D [Cont.] (Rocket Launch) 

The motion of the rocket is influenced by the thrust generated by it and the 
forces'actingon H due ,0 Us in.erac.ion wi.h ihe earth and Cher ceiesual 

'“rosier, let us assume that the forces due to interaciion with celestial bodies 
(Cher than the earth, are insignincant. This is a process of 
he validity of this assumption needs to be established at a later stage, wun 
thU assumption a descripL of the problem can bcdone in terms of a closed 
svstem wTth two objects the rocket and the earth, each tn.cractmg w, h the 
other The variables characterizing the rocket are its position an ve tty 
!hlh resKcu" some fixed pent on the earth and the thrust generated by 

Fit) represent the thrust generated by the rocket in its upwar 

Since the motion of the rocket is governed by the laws of dynamics, he 
acceleration of the rocket x"(r), the time derivative of x (r), is influenced by 
following three forces. 

(i) F(t): thrust generated by the rocket, 

(ii) G: gravitational force of attraction, 

(iii) /: frictional retarding force. 

The direction in which these forces ac. can differ depending on whether the 
rocket is going up or down, as shown in Figure 3.10. 




G 


(Downward motion) 

(Upward motion; 

Figure 3.10 System characterization for Case Study D. 
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6 



Figurk 3.11 System characterization for Case Study D. 


As a result the system characterization involves four variables. The 
relationship between the variables is given by the laws of dynamics. The 
gravitational force varies with x, whereas the retarding force depends on both 
,v and x'. The relationship is as shown in Figure 3.11. The parameters of the 
system are the mass of the rocket and the constants of the functions which 
relate G to x and / to x and x'. 

Note that if the forces due to interaction with other celestial bodies are 
significant, contrary to our assumption here, they can be included as 
additional variables acting on the system from the outside environment. In 
this case the system is no longer closed but is an open one. 

In general, the thrust F(t) is of constant magnitude F m and acting for a 
relatively short period in relation to the time taken to reach the maximum 
altitude. In this case we can approximate the thrust as an impulse. 

Since the position changes continuously with time, the time variable is best 
treated as being continuous. As the motion is predictable, the problem can be 
described in terms of a closed, dynamic and deterministic system. The goal of 
the study is to obtain the relationship between F m and x m the maximum 
altitude reached. From this, one can calculate the thrust needed to ensure that 
the rocket reaches the desired altitude. 

6.5 Case Study E [Cont.] ( World Population ) 

The human population is continuously changing with time, increasing with 
each birth and decreasing with each death. With lime, people age, and as a 
consequence the age structure of the population changes. In describing 
population changes one can treat the time variable as either continuous or 
discrete. We shall first deal with the case where time is treated as a continuous 
variable. 

The simplest characterization is in terms of a closed dynamic system with 
human beings as objects, and characterized by one single variable, N(t), the 
total population at lime t. and two parameters birth rate h and death rate il. 
The rate of change of population, N‘(t). is predictable and affected by N(t), h 
and d as shown in Figure 3.12. 
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Figurl 3.12 System characterization for Case Study E. 

The above characterization is obviously not adequate to predict ‘he 
population in 100 years time, since it ignores many relevant features o 
population change. One such is the constraint that the earth can support only 
a finite number of people. This can be included in the system characterization 
by including a new plme,er. L, .he upper lin.il .0 .he popu ahon. As MO 
reaches the level Lthe rate of change in population goes to zero. 

A slightly more detailed characterization is one where the population 
divided in,o diiTeren, groups, based on see. In .hisca* we Have ,wo vanabte. 
N (i)and NAt), representing the total male and female numbers 

two groups can have different birth and death rates. 

A more refined characterization is one where the grouping is based on age 
instead of sex. The age structure can be characterized in two different ways. In 
the first and simpler characterization, we have three groups. 

group 1: young and juveniles 
group 2: mature and child-bearing 
group 3: old. 

In this case the system is characterized by three variables - NADU =;3) with 
Nft) representing the number of people in group i at time t. The system 



Figure 3.13 System characterization for Case Study E (age structure included). 
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Figure 3.14 Birth and death rates as functions of age x. 


characterization is as shown in Figure 3.13. Note that we have different death 
rates for each group, and there is transition from group 1 to group 2 and from 
group 2 to group 3, and additions to group 1 with births. In the second, we 
characterize the age of each human continuously. In this case we have one 
single variable N(t y x) with N(t y x) Sx representing the number of people in the 
age group .v to x + dx at time t. In this case the birth rate fr(x) and death rate 
J(x) need to be specified as continuous functions of age. Typically the shapes of 
these functions are as shown in Figure 3.14. Death can occur at any stage, and 
this is reflected in the continuous distribution of death rate with age. Death 
rates are higher than average at very early ages (infant mortality) and at old 
age. In contrast, the birth rate is non-zero for a limited age range because of the 
limited fertility period of females and male juveniles and old males being 
incapable of reproduction. 

Yet another system characterization is one where we have only a single 
variable, N(t), to represent the total population at time f, but includes the 
gestation period. In this case we have an extra new parameter r to represent 
this period, and the rate of change in N(t) at time l depends on N(t — x). 

When we treat the time variable as discrete, the system is characterized by 
variables representing population numbers at discrete time instants. Changes 
to these variables occur due to births and deaths over the interval (A) between 
two successive discrete time instants. Thus, instead of birth and death rates (as 
in continuous time characterization) we have births and deaths over different 
intervals. In the simplest description we have, for each interval, three 
variables - N(t), B(t ) and D(t), with t representing the interval number; N(t) the 
population at the end of the interval; B(f) and D(t) representing the births and 
deaths in the interval. The relationship between the variables is as shown in 
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b(t) 


(a) Gestation period ignored 



b(t) 


(b) Gestation period included 

Figure 3.15 System characterization for Case Study E. 

Figure 3 15, where the first characterization ignores gestation and the second 
includes it (It is assumed that the interval length A and gestat.on period r are 
constant with t = *A.) We can obtain more detailed characterizations of the 
system using similar approaches to those above when t.me was taken as a 
continuous variable. 

Comment In the above system characterization we have assumed that the 
changes to the variables are predictable. This assumption .s justified only when 
the population numbers ate lat g e and the in.et.al otstud, ,s smallWhen one 
or both of these assumptions do not hold, then uncertainty becomes 
important feature of the system characterization. 

We have also assumed that the changes occur gradually in the ca “ 
continuous time characterization. This implies that it is ' nad ^ ua ^‘° h f v S e C ^ 
catastrophic changes that can occur due to large-scale 

assumed that in the simplest characterization the parameters h d and Lam 
constants This is valid if there are no external factors affecting them. If 
assume^That L depends on factors such as technology level. </ depends on 
medical advances, etc., then we need to incorporate these ,nto ‘^ syS 
characterization. This can be done by regarding these a^ y , te m 

outside the system (i.e. from the system environment) and treating the sy 
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as an open one. This will result in more variables and a more refined 
description. 

6.6 Case Study F [Cont.] ( River Pollution) 

In order to solve the problem of how to keep concentration levels of pollutants 
below prescribed levels. Gamma Inc. needs to obtain changes to concentration 
levels in the river over time under different dumping policies for chemical 
waste. A policy for dumping is defined by the time history of dumping. 

We shall assume that the dumping is carried out at discrete time instants. 
The effect of dumping is to increase the concentration level suddenly, at the 
moment it occurs. With the passage of time the chemical waste diffuses into the 
water and also gets transported downstream. Depending on the waste, it can 
cither go through a process of transformation so that it ceases to be an 
undesirable waste (as with biodegradable waste) or retain its chemical 
structure and continue to be an undesirable chemical waste. 

In the systems approach the river can be treated as a single object with the 
waste dumped being an input variable (from the system environment) and the 
water discharged, either to an ocean or a lake, being an output variable (to the 
system environment). In the simplest characterization the river can be viewed 
as a one-dimensional channel, of constant cross-section A, with x = 0 
corresponding to the location where the waste is dumped into the river. Let L, 
and L 2 represent the length of the river upstream and downstream from the 
point where the waste is dumped. As a result, we have - L, < x < L 2 with 
distances measured downstream being positive and those measured upstream 
being negative. The variable of interest is given by T(t,x) - the concentration 
level of chemical waste at location x from the point of dumping at time f. The 
variable changes continuously with t and x. Another variable of interest is the 
velocity of flow - the simplest characterization for this is when it can be treated 
as constant of value U. 

Since the river receives clean water from an external source the con¬ 
centration level is zero at x = — L t , hence T( — L x , I) = 0. If we assume that the 
lake or ocean to which the river discharges is very big, then the concentration 
level there is also negligible. Hence T(L 2 , f) = 0. The diffusion and transport¬ 
ation of the waste is predictable and governed by the laws of fluid mechanics. 

As a consequence the problem can be described in terms of an open, 
deterministic dynamic system. The aim is to obtain time histories of 
concentration levels under different dumping policies so as to select the policy 
which disposes maximum waste without exceeding acceptable limits. 

In the above characterization we have viewed the river as a one-dimensional 
channel with constant cross-section and ignored the lateral diffusion of the 
chemical waste across the width. Since the waste is dumped at a point, rather 
than along the whole cross-section, the above characterization is an oversim¬ 
plification. A more refined characterization includes such spatial variations. 











40 


MATHEMATICAL MODELLING 



coordinates are given by (x, y ,-) (see f igu ’ < , x) sjnce 

X. the" velocity of Row, U(x), is a 

fU There are many different time scales in this system characterization We 
mention a few: (i) time for chemical reaction; (ii) time for diffusion; and (...) time 
for transportation (~ LJU). 

Comment. If the river is discharging into an ocean and L 2 is ^alUher iwe 

changes considerably, then this must also be taken into account. 

6.7 Case Study G [Cont.] ( Weather Changes) 

The economic viability of the solar heating system for a swimming pool 
depends on the number of days in a year it is sunny (i.e. sunshine or more 
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8 hours) and partly sunny (i.e. sunshine between 4 and 8 hours). On these days 
there are savings to be made in the consumption of liquid fuel if a solar heating 
system is installed. The installation of such a system will be economically 
justifiable if these savings exceed the total of the operating and capital 
repayment costs. Thus the most critical variable in the economic analysis is the 
number of days in a year w hen it is either sunny or partly sunny. Hence we 
need to obtain estimates for the number of sunny and partly sunny days in a 
year. 

Towards this end, we note that changes of weather over time are 
unpredictable and influenced by many factors. The time and length scales 
associated with these factors can vary considerably - ranging from a few 
minutes to days and from a few kilometres to thousands of them. Also, not all 
factors are properly understood. This suggests that a “black-box" represent¬ 
ation may be appropriate with the description given in terms of a closed 
dynamic system with a high degree of uncertainty. Since our interest is in the 
number of hours of sunshine in each day, the time variable is best treated as 
being discrete with the interval of discretization being a day (from midnight to 
midnight). Let N(t) represent the weather on day f, viz. sunny, partly sunny or 
cloudy. As a result we let N(t) assume the following three values: 

N(t) = 1 if day t is sunny, 

= 2 if day t is partly sunny, 

= 3 if it is cloudy. 

NU) changes randomly with time. Recognizing that there is a certain 
persistence about weather patterns, N(t) at time t could be characterized as 
depending in a random manner on the value at t —j, i.e. N(t —j), 1 <j <J. 
Note that this ensures that the principle of causality is not violated. J is a 
parameter reflecting the degree of dependence on the past. J being large 
implies that the influence of factors with large time scales is included. 

We shall assume that J = 1 for the simplest characterization. In this case we 
have the system parameters given by the probabilities characterizing the 
random transitions. 

Comment. In the above characterization we assumed that the probabilities 
of transition do not change with time, and hence are treated as parameters. 
This assumption is often not justified. A slightly more refined characterization 
is to divide the year into different seasons summer, winter, etc. - and use 
different probabilities of transition for each season. 

6.8 Case Study H [Cont.] (Thermal Power Station) 

The thermal power station produces electricity by burning coal. The process is 
not direct. The chemical energy in the coal is released as heat by combustion 










42 


mathematical modelling 



^ , kinetic Electricol 

Chemical Thermo, Kme he energy 

energy energy 


FiglRF. 3.17 Energy transformation in a thermal power station. 

and the heat in turn is used to convert water into steam. These two processes 
".he boiler. The s.ean, produced in .he boiler “ « 
where the thermal energy in the steam is converted into rotational kinetic 
tZ ThSy is uSd ,o drive .he ro.o, of .he genera.o, and a ro.ahng 
magnetic field is sc. up by exciting the rotor. When 

conductors of the stator, energy transfer occurs resulting in the kinetic energy 
being transformed into electrical energy. This sequence is shown schem- 

dl The y voUage andfrequency of the electrical output must always be held close 
to some^re^^cified values. As the electrical load (i.e. demand) on the power 
station varies^he input to the boiler, turbine and generator must be varied so 
hat the output of the generator matches the load. This is usually done by 
means of various regulators; the problem is to design these regulators. In order 
S understand how changes to input affect the output for 
tSe boiler the turbine and the generator. We shall confine ourselves to 

btU FoMhe h pSl^2s1n S mind, the thermal power station can be viewed as a 
closed deterministic dynamic system consisting of three interacting su 
systems with output from the boiler (first subsystem) being the input to the 
turbine (second subsystem) and the output of turbine being input to the 

^"^nThcsimplesfcharaclerization each subsystem can be viewed as ai “black 
box” With the input and output related by a dynamic relationship. The 
different input/output variables for the three subsystems are as shown 



Figckl 3.18 System characterization for Case Study H. 
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Figure 3.18. The goal of the study is to find the dynamic relationships linking 
the input to the output for each of the three subsystems. 

Comment. Instead of viewing the subsystems as “black boxes” one can 
characterize each of them in terms of the physical processes taking place. 
This would lead to a more detailed description involving a large number of 
variables. 

6,9 Case Study I [Cont.] (Component Reliability) 

Since the control element is essential for the operation of the machine, the 
machine becomes inoperable whenever the element fails, and is made operable 
by replacing the failed element with a new working one. The solution to 
specific problems regarding the operation of the machine can be obtained by 
focusing attention on the control element. 

A description of the control element, in the context of the problem, can be 
done in terms of an open system with a single object and a system environment 
as shown in Figure 3.19. The simplest characterization, using the “black box” 
approach, involves a single variable. A'(f) with 

X(r) = I if the element is working at time t , 

= 0 if the element is not working at time t. 

Thus. X(f) changes from I to 0 whenever a failure occurs, and from 0 to 1 
whenever a failed element is replaced by a new one. 

Let Tj, i = 1.2,... represent the time to failure (or age at failure) for the ith 
element, r, is a random variable. If all spares are identical and failures are 
independent, then the various times to failure are a sequence of independent 
and identically distributed random variables. Let p represent the time needed 



Fiui Kr 3.19 System characterization for Case Study I. 
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x(t) 



o(t) 






N(t) 


Figure 3.20 Time histories of X«). a(0 and N(t) for Case Study I. 

to replace a failed element by a new one. We assume that there is very little 
variability assoc.ated with this, and hence it can be treated “ ■ 

If we assume that the state is continuously monitored, then failure 
detected at the instant it occurs. We assume that replacement commences 
immediately after an element fails. Then X(f) changes randomly over time. Le 
N(t) represent the number of replacements that have taken place in the interval 
[0, f]* and a(f) be the age of the element at time f if X(f) - 1. A typical histo y 
of X(f) N(t) and u(f) is as shown in Figure 3.20. 

If p is small in relation to the mean life of the control element, then we can 
approximate p as being zero. This is a simplification which is valid n many 
instances In this case the instances of failure (as well as replacement) can be 
regarded as random points along time, with the time interval between 
successive points corresponding to failure times of different spares. 

The system parameters are the parameters of the distribution function 
characterizing the time to failure; Tthe total time interval of interest and NT 

the total number of spares available. . hox” 

An alternative characterization (as opposed to the previous ^ 
characterization) is one which describes the mechan.sm of fa.lure of the elemenL 

Suppose that failure occurs due to crack propagation. In this case the variable 
characterizing the sys.etr, arc given by X0, and TOW)- - ***££ 

d y rd^iaXin2sa":S ^alen't oFa Patled eienren, is 
a range within a and b. 
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Age (t‘) » 

(Gradual growth) 



Failure 

1 / 


Age (t*) —• 

(Sudden growth) 

Figure 3.21 Crack propagation for Case Study I. 

completed and the machine made operational. Failure occurs when Y(t) 
reaches some critical value L. The crack propagates randomly with time and 
the changes over time can be either as gradual or sudden as indicated in Figure 
3.21. The latter characterization is appropriate if the growth in crack occurs 
due to shocks occurring randomly in time and the effect of each shock on crack 
growth is random. 

The goal of this study is to obtain the statistical characterization of: (i) the 
instant at which the last spare fails and (ii) the number of spares, N (T), needed 
to keep the system operational for a period T. 


6.10 Case Study J [Cont.] (Speed of Aeroplanes) 

The problem is to forecast the maximum speed of aircraft in the year 2000. 
Improvements to the maximum speed of aircraft occur with advances in 
technology. The processes responsible for these changes are very complicated 
and involve many factors. 

We can describe the changes to maximum speed of aeroplanes in terms of a 
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Figure 3.22 System characterization for Case Study J. 


closed system. The simplest characterization, based on a ‘‘black-box 
approach.' involves a single variable, X(r), representing the maximum speed 
achieved at time f and increasing with time in a deterministic manner. One can 
characterize this increase over time in two different ways. 

(i) XU) changes according to some pre-specified function DM) with 0 being a 

(ii) X'(f) 1 = dX(f)/df} being influenced by X(f) in some pre-specified manner 
as shown in Figum 3.22 with L, a parameter, representing the upper limit 
to the maximum speed. 

In real life, technological changes occur in an unpredictable manner and 
hence the assumption that changes to the maximum speed are deterministic is 
obviously inappropriate. A way of incorporating the uncertainty is to view the 
speed achieved at time f being of the form 

X(t) = D(t,0)+V(t) 

where D(t,0) is a deterministic function and V(t) is a stochastic process 

^An'alternative characterization is as follows. Technological changes do 
not occur continuously in time rather, they occur at discrete time instam 
due to technological breakthroughs. The time instants of breakthrough^ as 
well as the efTcct of each breakthrough on the speed, are highly variable and 
hence need to be characterized in a stochastic framework. Let t, denote 
time instant at which the ith breakthrough occurs and let Y, ^present the 
improvement in the maximum speed due to the breakthrough. ( ) 
,31“ sum of break,hroughs and a typical plo, of TO « shown m 
Fionre 3 23 Let T i = 1,2... and with r o = 0. Then. t„ ^ 

represents the time'inte'rval between two successive breakthroughs and is a 
sequence of random variables. Let N(t) represent the number of break th oughs 
in the interval [r 0 .t] and this is also a random variable. We assume that Y, 
is a sequence of independent and identically distributed random> vanabkj 
with a distribution function F(.) and that t, is also a sequence o* nd «P«^ 
and identically distributed random variables with a distribution functio 
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Figure 3.23 Typical plot of ,Y(f) for Case Study J. 


Comment. In the simpler characterization we have assumed L to be a 
constant. The value of L depends on the type of technology. In the case of 
aeroplanes we have three distinctly different technologies - propeller-driven 
engines, jet engines and rockets. Thus a more refined characterization is one 
where we have three different limits - L { for the propeller; L 2 for the jet and 
L 5 for the rocket with L x <L 2 <L y The older technology gets replaced 
when a major technological advance occurs. These changes need to be 
characterized. Within a given technology type, changes occur as described 
earlier. A similar refinement can be done for the alternative stochastic 
characterization. This can be done by separating breakthroughs into two 
types-major and minor. Major technological breakthroughs result in a 
change in the technology type whereas minor breakthroughs result only in 
speed improvements within the current technology. 

6.11 Case Study K [Cont.] (Supermarket Operation) 

Here we are interested in determining the optimum number of checkout 
counters in a supermarket to maximize the expected profit. Queue lengths at 
checkout counters in a supermarket vary randomly over time. If there are 
insufficient checkout counters, queue lengths tend to be excessively long and 
this affects customer patronage. On the other hand, if there are too many 
counters the operating costs are increased. In either case the profitability of the 
supermarket is affected. Thus a suitable trade-off must be made between the 
number of counters installed and customer patronage to maximize profits. In 
other words, the problem is solved once a relationship between some measure 
of queue (e.g. average queue length or average time period for which the 
customer has to wait before being served) and the number of checkout 
counters is obtained. 
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Towards this end the problem can be described as an open dynamic system. 

NU) bCing Ser ' ed bl The 

S is random. In rhc simples., ctarncttnzrt.or, w« 

- that the inter-arrival times are independent and identically distribute 

^^S.SnO.)and, h a.i ^service 

of checkout counters. 

Comment A more refined characterization is one where the checkout 
C omtnt • . «m j “•(•..ct” Customers with more than L 

~ a\\S”b e comcs 8 sho,,e, .here is a possibility for a 
CU r„Thc r rpt7chTrac«riza,ion we bare assumed that the ^..arrival 

derailed^ description would take into account factors such as spccal sales, 
influence of advertising, etc. 


Customers 
to be 
served 



Customers 

served 


I Kii Rt 3.24 System characterization for C asc Study K. 


SYSTEM CHARACTERIZATION 


49 


6.12 Case Study L [Cont.] ( Demand for Soft Drinks) 

The demand for soft drinks varies with time and is influenced by many 
factors - price, advertising, competition, population changes, seasonal vari¬ 
ations, taste, conditioning, etc. The effect of some these factors on the demand 
is predictable, but for most it is uncertain. 

Since the demand changes dynamically, the time variable is important and 
can be treated as being either continuous or discrete. Discrete time character¬ 
ization is more appropriate, for then we can characterize the problem in terms 
of demand over different time intervals or periods. A month is a convenient 
interval to use for characterization, since seasonal variations can then be easily 
included in the characterization. A weekly description is too detailed whilst a 
yearly description is too coarse. 

The simplest characterization can be done in terms of a closed dynamic 
system involving a single variable, *(*), representing the demand for the soft 
drink in period /, and being composed of two terms: 

X(/) = D(f)+ V(t) 

where D(t) is the deterministic part representing the effect of factors whose 
influence is predictable (c.g. growth in population, seasonal variations) and 
V(t) the stochastic part representing the effect of factors whose influence is 
uncertain. Because of the dynamic nature of change, D(t) is a function of its past 
values and, similarly, V(t) is a function of its past values. As a result we can view 
X(f) as a function of its past values and of D{t) and V(t) and possibly their past 
values. Since there is yet no theory to characterize the relationships, they must 
be obtained empirically using the historical data given in Table 2.7. 

The goal of the study is to obtain this relationship, and once this is done the 
demand in the future can be forecast. 

Comment. In the above characterization the effect of the different factors is 
either ignored or included in an implicit manner. An alternative characteriz¬ 
ation is one where the effects of these factors are described explicitly. In this 
case the problem can be described in terms of an open system with the relevant 
factors as inputs from the environment. The characterization will be 
more involved c.g. advertising can affect the behaviour of a competitor, 
which in turn alters the effectiveness of the advertising and so on. This 
illustrates the statement made in an earlier section that identifying causal 
relationships between variables in social/economic systems is in general a 
difficult task. 

Notes and References 

Over the past few decades a large number of books have appeared on systems 
thinking and the systems approach. Items [1] [8] are a small sample; a more 
philosophical approach can be found in [9]-[l 1], 
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subsystems, etc.) discussed in discussions on these issues see [14]- 

appeared in various phrlosoph,cal journals 

"LS <o, characterizing ekcuicaK mechanical -J— 

systems. The details of this type ° re P^ h { 0 f representation 

r ^ tm an. 

C2 ??n nil are a small sample of books which illustrate the growing use of 
the systems approacb'di understanding and in solving problems from a diverse 

range of disciplines. 
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CHAPTER 4 


Mathematical Modelling 


1 Introduction 

in the previous chapter we showedinis 
problem can be treated as th y y We commen cc with a brief 
chapter we d.scuss the modelling of such a system.^ ^ ^ which we use the 

depth. „u an .pr i« to develop the process of mathematical 

^ ■*<**• -»» d ' ,^eren, topics 

required for successful modelling. 

2 A General Introduction to Models* 

2.1 Model-Varying Meanings 

One of the major deficiencies of most natural languages is that 

which il is used. In the case of the word model , an exarmnat ^ ^ 

* Sections 2-5 »n be skipped when te.dinp to, the «... » «« *•» *~ d “ 

philosophical) issues of models and modelling. 
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(i) Mathematical logic. “A possible realization in which all valid sentences 
of a theory T are satisfied is called a model of T" (Tarski [20]). 

(ii) Physics. “. -. the proposition comprising a model of an X are the same as 
ones constituting what may be called a theory of an X (The Bohr Theory of 
the Hydrogen atom)” (Achinstein [21]). 

Although the word model used here has a precise and well-defined meaning in 
each branch of learning, the meaning is different in each usage. Thus, it is 
essential to state clearly the usage intended in a particular context so as to 
avoid any confusion. 


2.2 "Model" as used in this Book 

Our usage of the word “model" in this book is given by the following 
definition - “A model is a representation of a system.” Furthermore, a 
“model" is said to be an "adequate model” if it is adequate for the goal in the 
mind of the modeller. 

The key terms in the above definition are: (i) system, (ii) representation, (iii) 
goal, and (iv) adequate. We need to define these terms so as to give a precise 
meaning to our usage of the word "model”. We have discussed “system” and 
“goal” in Chapter 3. In this section we explain the remaining two terms. 

2.3 Representation 

A representation of a system, in general, contains less information than the 
system it represents. This point is important, as the representation should 
contain only those features or characteristics of the system which the model 
builder feels are relevant and significant for the goal in mind. The represent¬ 
ation of a system can be either a physical device or an abstract formulation. A 
few examples of the different types of representations are: 

(i) Scaled Representation 

Here the physical object is represented by a scaled-down version in which 
some scaling relationship is obeyed. For example, a geometrically similar 
scaled model of a building may be used in a wind tunnel with the goal of 
predicting wind loads on the real building. 

(ii) Pictorial Representation 

Simple examples include a globe representing the earth, a map representing 
topographic features of the land surface or a photograph representing some 
physical object. 
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(iii) Verbal Representation 

This consisis of a verbal descrip,ion of a s,sto 

—- 

(iv) Figurative/Schematic Representation 

g^P 

interrelationships, are represented by connected sym 

(v) Symbolic Representation 

Without this correspondence the symbolic representa i 

2.4 Adequacy 

and wc illustrate it again by the following example. 


The colour of a car or .he type of upholstery is not significanl for a study of ihe ride characierisrkJ 

1SSHSSSSS2 

representation. 

shall defer discussion of this aspect to a later p 

^ Model Classification 

Broadly speaking, models ean be classified info .wo main types or calegor.es: 
“S or physical models: ,2, non-malenal o, formal models. 
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As the name suggests, material models have a physical existence with a direct 
relation to the real-world problem. For example, a scaled model of a system is a 
replica of that system, and is related to the original system through a scale 
transformation in terms of features that are important. In many cases it is not 
possible to build a scaled model; for example we cannot build a scaled model of 
the planetary system and of the rocket to see if the rocket will perform 
satisfactorily. 

In contrast, non-material or formal models are abstract representations. 
These may take a physical form, such as symbols written on a sheet of paper, 
but have no direct similarity with the real-world problem. For example, a 
descriptive model of a system is a representation of the system in some natural 
language. Wc have already encountered one such representation - termed 
“system characterization” - in the previous chapter. A system characteriz¬ 
ation is a descriptive model of the system associated with the problem, and 
plays a very important role in building a mathematical model. 

Later in this chapter we will briefly examine types of models which have an 
important relation to mathematical models. 

4 Mathematical Model 

A mathematical model of a system is a symbolic representation involving an 
abstract mathematical formulation. A mathematical formulation is composed 
of symbols, and makes no sense outside mathematics. The symbols have 
precise mathematical meaning and the manipulation of symbols is dictated by 
the rules of logic and mathematics. A mathematical formulation is not a model 
by itself. It is only by relating the mathematical formulation to a system 
characterization (i.e. variables and relationships, as discussed in the previous 
chapter) that the formulation becomes a mathematical model. An adequate 
mathematical model is a mathematical model which is adequate for the purpose 
in the mind of the builder. 


Example 2 ( World Population: Case Study E) 

The following first-order ordinary differential equation formulation 

dn(f)/dr = a\L — n(t)} 

with n(f 0 ) = n 0 specified, is an abstract formulation which makes no sense outside mathematics. 

By associating the variables and the parameters \a , L\ of the formulation, on a one-to- 

one basis, with the variables and parameters of a system characterization, as indicated below, the 
above formulation becomes a mathematical model for population changes: 


Formation System characterization 

t Time 

n(0 Population at time t 

a (Birth rate)—(Death rate) 

L Limit on population 


A different way of looking at this is to view the abstract mathematical 
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, pvii(K no sense outside mathematics. By 
formulation as a dummy which . . f h f ormu lation with the 

- —*.. 

mathematical model. 

4.1 Empirical vs Theoretical Model 

iiiSi=:S=S 

rSsSitrSiHSSS 

" . u i, lc : n thi , cateeorv are sometimes called empirical moaeib . 

"in this book we will not dilfcrentiale between theoretical and empmea 
mathematical" models, bu, simply call all of there -ma.hereattea, models . 

4 2 Classification of Mathematical Models 

as follows: 

Canary A: lb,reflations suitable fot modelling deterministic static 
Caw B: Stations suitable to, modelling determ,nistic dynamic 

systems. 

• „o„ h* further subdivided. We discuss formulations 

Each of these ea.egones can be funhet suMtv C and D in 

belonging to categories A and B in Chapter o, anu 
Chapter 7. 

5 Analog and Simulation Models 

5.1 Analog Models 
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follows: “A real system S x is an analog model of another real system S 2 (or vice- 
versa) if and only if the mathematical models of the two are identical in terms of 
the mathematical formulation used to build the models." This implies that 
there exists a one-to-one correspondence between the variables of the system 
characterization of S { and S 2 . We give a simple example of illustrate this. 


Example 3 

A mechanical system can be characterized in terms of objects such as masses, springs and 
dashpots, using variables such as force, displacement, mass, spring stiffness and damping. Using 
the well-known laws of dynamics we can obtain a mathematical model involving an ordinary 
differential equation formulation as a dummy. An electrical network consisting of capacitors, 
inductors and resistors can be built such that the resulting mathematical model uses the same 
ordinary differential equation formulation as a dummy. In this case, each is an analog (model) of 
the other, and there is a one-to-one correspondence between the variables as indicated below. 


Mechanical system 
Force 

Displacement 

Mass 

Spring stiffness 
Damping 


Electrical network 

Voltage 

Charge 

Inductance 

Capacitance 

Resistance 


It is important to note that the validity ofS, and S 2 as analogs of each other is 
often true only under certain conditions. In the above example the analogy is 
valid only at low frequencies. 

5.2 Simulation Models 

A simulation model of a system is another system that imitates the latter in 
some respect, such as in shape or function. 

Since the goal is to simulate only certain aspects of the given system, the 
building of a simulation model often requires only a scant knowledge of the 
system being simulated, as illustrated by the following example. 

Example 4 

A quartz mercury lamp simulates the sun. in the sense that spectra of the two are similar. 

Simulation is a crude and deliberate imitation. It is dictated more by 
pragmatic considerations. Thus a simulation model of the kind described 
above contains less information of the system than an analog or a mathemat¬ 
ical model of the system. 

With the advent of computers the term "simulation" has acquired a special 
meaning, for in that context it refers to the relationship between a mathemat¬ 
ical model and its implementation on a computer, as shown in Figure 4.1. In 
this usage, "simulation" is a relationship that concerns the faithfulness with 
which the computer carries out the instructions intended by the mathematical 
model. In executing these instructions it is mimicking the system. This topic is 
discussed in detail in Chapter 9. 
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Figure 4.1 Moticlmodelling. simulation. 


6 Mathematical Modelling _ 

A. indicated earlier,an adequate™ 

— > 

^'ingenlral, one seldom 

tz 

ESSSX^ parses the tes, themodel canjhen be said to be an 

adequate model: if no,, the i,., alive P~°? “ ^1 ,, is difficult ,o 

There is yet no theory of mathematical Each „ ew modelling 

envisage the evolution of such n' eo ^ 1 . . t h a t there is no procedure 

exercise poses new challenges. This ocs ^ a bas|C met hodology which is 

commor^u^au'r^t'henwticarmt^ening^exerches^^Ve^sha^l^fiint^prt^ent^a 

m^n^ste^Thir^foUcwedb^ainore^etailed description which highlights the 
different topics needed for modelling. 


6.1 Mathematical Modelling Process (Simplified) 1 

The mathematical model building prOC ^^ of four 

as an iterative multistep process as shown in Hgure 
steps, and we briefly discuss each of them. 

** I »- ch—lion,, mI in the gggX 
and can be viewed as a process of siinplil cation an 
execution of this stage requ.res a deep understand g f^ P 
of the system (associated with the rea world of th€ f? n .he 'first stage a 
Step 2 (Mathematical model). This invo .wtw** f g formu i at i 0 n selected is 
mathematical formulation is selected. The type ot lormu 
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Figure 4.2 Mathematical modelling process (simplified). 


sometimes dictated by the theory used in the system characterization, and 
when no such theory exists it is selected on an ad-hoc basis (i.e. based on the 
model builder’s intuition). In the second stage the variables of the 
formulation are related on a one-to-one basis with the variables of the 
system characterization. Thus the “dummy” (abstract formulation) is 
clothed in terms of the physical features to produce a mathematical model. 
This step involves strong interaction between the physical aspects of the 
system (associated with the real world of the problem) and the abstract 
world of the mathematical formulation. 

Step 3 (Analysis). In this, the relationship between the mathematical formul¬ 
ation and the system characterization established in Step 2 is temporarily 
discarded. This is because physical aspects of the system play no role in the 
analysis of the formulation. The analysis is abstract, and done according to 
the rules of mathematics. 

Step 4 ( Validation ). In this, the analysis of the formulation is interpreted in 
terms of the relationship established in Step 2 (and temporarily discarded in 
Step 3) to yield the behaviour of the mathematical model. This behaviour is 
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compared »i.h .hebeha.iou,of.be system 

if,he mode] fails ,he ««>>'»"'"Ich-Tmct ^ 
must be rejected. When such a si ua h tcr j zat j 0IU the mathematical 

make changes - either in . the S J* must be done judiciously so that the 

an adequate mathematical model. 

"h:?irl e .ical“ g "o:iy a . 00 , for so.v,„ 6 reahworld prob,ems 
and not an end in itself.) 


6.2 Validation Criterion 

T^r^ 

agreement bets.ee,, the mode, and •^^^“Sza.ion and a 
complex. This follows since mb y dedtopro duce an adequate 

complicated mathematical formulation will be ne P model 

model. On the other hand if the crUenon mathemat- 

built on a coarser system charactenzat Astringent validation 

ical formulation may produce an f d X hav ^match ng"n detail, but this 
suitable I trade-off btriweenthe degree of agreemen, between the system and the 
characterization and mathematica °™“ behaviour either the criterion 

must^weakened wlh^mrKle? made more complex so ritm the tworesfmns« 

S^»oTct^ 
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Figure 4.3 The model building process (detailed). 


the art aspect of modelling, which requires critical evaluation and decision 
making. 

6.3 Mathematical Modelling Process ( Detailed ) 

The modelling process shown in Figure 4.2 does not illustrate all the features 
of mathematical modelling. Figure 4.3 shows the modelling process in greater 
detail. It shows more completely the different activities needed for successful 
modelling. It also shows the demarcation between the physical world of the 
system and the abstract world of"the formulation and the interaction between 
the two. In this section we describe the modelling process shown in Figure 4.3, 
and comment on the different topics identified there. 

The starting point is the real-world problem as indicated in Chapter 2. The 
problem is then transformed into a study of a system with the goal of the study 
defined. Describing the system, based on prior knowledge of its behaviour, 
produces the initial system characterization. This topic was discussed in detail 
in Chapter 3. 

Relating the system characterization to a mathematical formulation 
produces a mathematical model. To do this one needs an understanding of the 
different types of formulation available. We discuss mathematical formul- 
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ations suitable for modelling deterministic systems in Chap . 
modelling stochastic systems in Chapter • . , de , one nee ds to 

In order to obtain the behaviour done by using 

assign numerical values to the P ara /" e ers world) in a process called 

system behaviour da,...(or <" “ '“chap.eHO borh fo, 

SS^Sas^oS^e.hisisdope ihe mohe, behaviour 

Ca The analysis depends in the structure of^the formulation used in ‘ 

Broadly speaking the analysis can '*• Qne can j gnore the corre- 
and computational. To carry ou svs tem characterization as the 

spondence between the formula m In Chapt ers 6 and 8 we 

discussed i n chapters 5 and 7 respec ‘ 

tively; in Chapter 9 we= discuss the= topic of syslcm charac teriz- 

atton am^or paramete^stim^hw. * n u* we discuss 

a„ P h .he paramerer 

results of the analysis. 


Notes and References 
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No two definitions of the * , he rolc an d usage of models is varied. 
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modelling and the teaching of modelling. Chapter 18 contains a list of such 
articles. Articles on modelling differ in presentation but the underlying 
methodology is more or less the same. In contrast, the methods advocated for 
the teaching of mathematical modelling are greatly varied. 
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Mathematical Formulations I 


1 Introduction 

ln the 3 " 

- on t? e second step , 

relating the system character '°" s cn 0 i ^a ^od understanding of 

formulation. In order to carry ou P .ivailable and a high level of 

the different types of tnalhemattcal formula mnsavmUbfe and a^ ^ ^ 

types of formulations separate y. ?. me re | evant aspects and finally 

introduction to the general struc ur . . Case Studies 

in Chapter 3. 

2 Classification of Mathematical Formulations 

and dynamic formulations. 

2 1 Static Formulations 

Static formulations are ma.hema.,cal for m ulanons,n,o,v,„ge,.h« algebra. 

S——- between the 


MATHEMATICAL FORMULATIONS - I 


65 


variables of the formulation and the physical variables of the system 
characterization. Such formulations are suited to modelling deterministic 
static systems. 


2.2 Dynamic Formulations 

Dynamic formulations involve two types of variables - dependent variables 
and independent variables. In the simplest type formulation there is only one 
independent variable and the dependent variables are functions of it. When the 
formulation becomes a model the independent variable is called time, and is 
denoted by t. The formulations are characterized by equations involving either 
the derivatives of the dependent variables with respect to time (when time is 
considered to assume a continuous range of values) or the dependent variables 
for more than one discrete value of time (when time assumes only discrete 
values). 

More complex formulations involve two or more independent variables. 
The formulation is characterized by equations involving either the partial 
derivatives of the dependent variables with respect to these variables or the 
dependent variables for more than one set of discrete values of these, 
depending on whether the independent variables are treated continuously or 
discretely. In formulations involving four independent variables the independ¬ 
ent variables are usually denoted by r, x, y and z. When the formulation 
becomes a model, t corresponds to time and the remaining independent 
variables correspond, say, to Cartesian spatial coordinates of the system 
characterization and there is a one-to-one correspondence between the 
dependent variables of the formulation and the physical variables of the 
system characterization. 

Based on the nature and the number of independent variables in the 
formulation, we can divide dynamic formulations into four types, as indicated 
below: 


Number of independent variables 

Nature of independent variable 


Discrete 

Continuous 

- 1 

Bl 

B2 

> 1 

B3 

B4 


2.3 Formulation B1 

In this formulation the dependent variables are functions of one independent 
variable (t) which takes on only discrete values: r j, r 2 , r, Without loss of 
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general,,y w= can assume ,b, suUabJe scalmg, lha, . 

values. Thus Ihe mdependen, v»n»bfc ot ,h,slype 
sequence xli) or x„ with I »l« » 8 0 < j s m. and are called difference 

"lirrS formulations are soiled for modelltng systems 
with discrete time characterization. 

2.4 Formulation B2 

'FoSSns’SonU <o «* «**»» can be dtvtded into three types: 
Ordinary premia, 

(2) /"’ ]fjraU^aS^latio»s These are given by e <TJ ations , J volv,ng the 
dependent variable x(r) an g These are given by equations 

,3, SS 

derivatives with respect to t. 

Such formulations are suited for modelling systems with continuous time 
characterization. 


2.5 Formulation B3 

These formulations are similar ,o Z£Z 

Iomrn a r S va[u e e^ n ,he S^. variables. Such formulations are 
called multi-dimensional difference equation formulations. 

2.6 Formulation B4 

In this formulation the dependent variables are ^rang Tot values. 

indicated in Section 7. 


MATHEMATICAL FORMULATIONS - I 


67 


3 Static Formulations 

As mentioned earlier, static formulations are mathematical formulations 
involving either algebraic equations or function optimization. The former type 
is used, for example, in fitting curves or surfaces to a set of data points 
generated by a system, with the system characterized as a “black box”. The 
latter type is used mainly for optimizing system operation. 


3.1 Algebraic Equation Formulations 

The typical form for such a formulation is given by an equation 

G(*,.y,0) = O (1) 

where G is a vector function, x and y are vector variables and 0 is a set of 
parameters. Often, x corresponds to input variables and y corresponds to 
output variables when the formulation is used to model a system characterized 
as a “black box”. 

The equation is linear if x and y appear linearly in G, otherwise it is non¬ 
linear. The variables and/or the parameter can be either constrained or 
unconstrained. 

3.2 Case Study A [Cont.] (Weight Lifting) 

The system characterization for this problem was done in Chapter 3, Section 
6.1. In this section we build a sequence of mathematical models which relate 
the weight lifted (L) to the body weight of the lifter (IT) leading to a formula 
which can be used to calculate the handicapped weight lifted. 

Model A.l 

We assume that the relationship is given by 

L=kW x (2) 

where k and a are constants. Then, 

logL= logk + a log W (3) 

The handicapped lift L is given by 

U = L/W* (4) 

with k and a obtained by fitting (3) to the data given. 

Note that in the above model we have treated the system as a “black box”. 

Model A.2 

This model is an improvement over the earlier one in that it is based on simple 
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such the system is no longer treated as a “black 


physiological arguments. As 

assume that the lift is proportional to ° f 

the lifter's muscle. Let ft represent thts area. Thts ytelds. 

L=M () 

body length scale, typical body length, then 

A = k 2 l 2 (6) 


Let I represent a characteristic 


and 


W=k 3 l 3 


since body weight is proportional to body volume. 

As a result, we have from (5)- (7) 

L = ki k 2 i*=k t k 2 m>) 2 / 3 = 1 w 2 ' 3 


(7) 

( 8 ) 


where k = k t k 2 k 3 2,3 ■ 

This is similar to Model A.I. except that -2/3 and the only unknown 
P wfth« model the handicapped lift is given by (ft) with a - 2/3. 


Model A.3 . , 

This mode, generalizes Model A.2. The lift is related ,0 average cross-secttonal 

area by 


L=M 81 , 0<01 < 1. 

implying loss in efficiency for large size. Similarly, 

A = k 2 l e \ 0 < 02 < 1 

implying variations in shape, and 

W= W 0 + K 3 l 3 


(9) 


( 10 ) 


(ID 


implying .ha, the tota^ body 

muscular weight and W x (=K 3 l ) tne muscu. 

from(9)-(ll) .... 

\jv- law- W,<f y ’ 


where K, p and W 0 are three unknown parameters. 
The handicapped lift is given by 

L = L/(W- Wo? 

with the parameters obtained using the given data. 
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3.3 Function Optimization Formulations 

The typical form for such a formulation is given by 

Maximize f(x,0 ) (14) 

X 

subject to a set of constraints of the form 

G(x, 0)> 0 (15) 

where / is a scalar function, G a vector function, x a vector variable and 0 a set 
of parameters. 


The above formulation is quite general. Any minimization problem or 
constraints of the form G(.v, 0) 0 can be reformulated into a form given by 

(14) and (15). For example. 


Minimize h(x,6) 

X 

(16) 

is equivalent to (14) under the transformation 


/(x,0)= -h(x,6) 

(17) 


A similar transformation can be performed on the constraints to get them 
into a form of equation (15). 

Some special cases of the formulation are: 

(i) Linear programming formulation: x real and both / and G linear in x. 

(ii) Non-linear programming formulation: x real and either one or both / and G 
non-linear in x. 

(iii) Integer programming formulation: x integer. 

3.4 Case Study C [Cont.] (Optimal Production) 

The system characterization for this problem was done in Chapter 3, Section 
6.3. We assume no carry-over from week to week; hence each week can be 
treated separately. We use the symbols as defined in the system charac¬ 
terization. 

Model C.l 

Consider butter production. We have 

Z, = T l Y l (18) 

which relates the milk used for butter production and the butter produced. 
The cost of producing this butter is given by 

CiZ x ( = c x T x Y x ). (19) 

Since there is no carry-over, there is no sense in producing more butter than 
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what is demanded. This translates into 

z, = t, y.^fv 

(20) 

Similarly, we have for cream 

z 2 = t 2 y 2 ^d 2 

(21) 

with an associated cost 

c 2 Z 2 = c 2 T 2 Y 2 

(22) 

and for yoghurt we have 

Zj = Tjyj<D, 

(23) 

with an associated cost 

c 3 Z 3 -c 3 T 3 Y 3 . 

(24) 

For the processed milk we have 

z 4 = t 4 y^^d 4 

(25) 

with an associated cost 

c 4 Z 4 = c 4 T 4 y 4 . 

(26) 

The total cost of processing is equal to 

t cJ.Y, 

i = 1 

(27) 

and the total revenue generated is given by 

i *z,-1 

i = i •=* 

(28) 

In addition we have a constraint given by 

t y.<* 

i = 1 

(29) 

and the cost of producing milk equals 

c 0 * 

(30) 


As a result the total profit is given by 


/(Y) = £ (Si - c i) T * y < ” 

i= 1 


(31) 


subject to constraints given by(20U2U (23) (25) » nd ^9)^ ^ ^ obta in 
This is of the form given by (14) and ( n ) a v y . < is ; 4 appear 

optimal T( = y, V 2 y 3 yj to maximize / gl ^ n ^ (3 ^ n J raint ^ ^ formulaUon 
linearly in both / and in the various inequahty constraints, 
used in modelling is a linear programming formulation. 
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If the selling price s, is a function of V; (I < / $ 4), then f is non-linear in the 
variables y f (1^X4). As a result the formulation used is a non-linear 
programming formulation. 

4 Difference Equation Formulations 

We start with formulations with one independent variable. The variables of 
the formulations are sequences of the form Z(k) (or Z k ) with Z the dependent 
variable (either scalar or vector) and k corresponding to discrete values of the 
independent variable f. If the discrete values assumed by f are equi-spaced, 
then Z k corresponds to Z(t ) with r = Ak, A being the interval of discretization. 

A general difference equation formulation with one independent variable is 
given by an equation of the form 

Z k + i = 9 k(Zk’Z k - 1 ,-■ ■ Z k - ni< ; u k ,u k . l ,...u k . mh \ 0 k ), — oc < k < oo (32) 

where Uj{ — oo <j < o o} is a known sequence. The structure of the equation 
is given by the function g k \ the order by the integer n k + I and the parameters 
by (V In the above formulation the structure, the order and the parameters 
change with k. 

Such a general formulation (i.e. the structure and the order varying with k) is 
of limited use in modelling as it is too complex. A simple formulation is one 
where these do not change with k, and is of the form 

Z k +1 = g(Z k ,.., Z k _ n , u kt .., u k _ m . 0 k ) (33) 

with the order of the equation being (n + 1). 

Note that, in the above formulation, there are no terms of the form Z k4j or 
u k+j ,j > 0. This is important, because when the formulation becomes a model, 
k corresponds to time and Z k corresponds to physical variables of the system 
characterization. Thus, terms such as Z k+J and u k+j imply that the future 
affects the past. This is meaningless, as it would violate the “principle of 
causality”. 

4.1 Classification of Difference Equations 

If the function g is linear in Z k _,(0 < n) and u k _ ,(0 ^j^m) then it is called a 
linear difference equation. If not, it is called a non-linear equation. Non-linear 
difference equations can be subdivided into different types - e.g. bilinear, 
polynomial, etc. 

If the parameter 6 k does not change with k, then the equation is said to have 
time-invariant parameters; if not, it is said to have time-varying parameters. 

4.2 Alternative Representation 

The formulation given by (33) is a difference equation of order (n + 1). By 
defining an enlarged vector Y k as follows: 
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we can rewrite (33) as 
with 
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Y k = [Z k ,Z k -i . 

Y k + l = G(Y k ,U k ,O k ) 


U k = [U*. «*-l. 

ence equatioi 
jation of ordi 
is better suited for computer analysis. 


(34) 

(35) 

(36) 


This process allows a difference equatmn of anyorder to be red atjon 

dimensional difference equat.on of order one. This alternat. P 


4 3 Existence and Uniqueness of Solution 

For the solution to exist (i.e. a sequence 

function GW must be suitably constrained UCM » “ ueness of the 

values of its arguments, then a solution will ex st. I o en H h 

solution one needs to specify some ««(U * = 0), 
conditions are given by V k being specified at some,sing |fe(j b 

value problem. 


4 4 Multi-dimensional Difference Equations 

In the case of two-dimerisioral^iffu^^®9“ a l*bVj°^ u k*' < ^*^j^ e indexed 

independent variables and^ h d ^ ^ firs , ln( iex(It)corresponds 

sequences of the form Z kJ .W . { soatial coordinate when 

,o time and .ha, rhesecond tadcx(llcormsp,>nd “ kof causa , iIy - , s 

.o )■ A typical first-order formulation « 

given by 

Z..| .;-9fZt.j.2,./-" Z *J»i ; "*•» ,I,J * 

with a,., being a specified double-indexed sequence 

To ensure the existence and umquene.of n ' h .'; 0 '“"“, | ial conditions 
constrain the function g and specify two se . l0 jh a t for partial 

sssasr:ssrr-—- - -- 

formulation. 
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4.5 Case Study E [Cont.] ( World Population) 

The system characterization for this problems was done in Chapter 3, Section 
6.5. We consider the discrete time characterization with the time interval of 
discretization being a constant A. We first consider modelling the total 
population through a single variable, and later we consider a more refined 
characterization which involves spatial grouping. 


Model E.l 

Let /V,( = N(i )) represent the total population at the end of the /th interval. If B, 
and D, arc the births and deaths in the interval /, we have 

N i = .N i . x ^B i -D i (38) 


We ignore the gestation period and assume that B, and D ( are proportional 


to N.-i, i e.: 


(39) 

and 

D t — dN,- t . 


As a consequence, we have: 

1 

II 

(40) 

with 

-5S 

1 

-c 

+ 

II 

(41) 


Thus the model uses a first-order linear difference equation as a dummy. The 
initial condition is given by specifying /V, for some specific i = / 0 . 


Model E.2 

This is an improvement over the earlier model in that it includes the gestation 
period. Let r be the gestation period and we assume that A « r. As a result we 
need to introduce parameter k such that 

A k * T. (42) 

In this model, then, equation (39) is modified thus: 

Bi = nNi- (43) 

and 

D, = dN,. t . 

This implies that births in interval / arc proportional to population in interval 
I - k). From (38) and (43) we have 

N f = (1 -d)N i , x ^hN i . i . k (44) 

implying that the model involves a linear difference equation of order (k + 1). 
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In this case the initial condition is specified by giving N, over an 
i 0 ^ I ^ *o+*- 


Model E.3 


Model E.1 is based on a coarse 
population is characterized thr ° ugh earth is divided into J regions 

refined characterization is one where the P anet earth « ^ ^ . 

with N i} representing the P°P U ^ Qn f he n population changes in each 
If we ignore gestation dnd ® in _ uations ofthe form given by (40). 

Howevw”tteTnWresting case is where rhere ,s migra.ion Iron, one regron to 

another. „r„„r.nle migrating from region j to k( ¥=j) in 

Je for region*. 

changes given by j (45) 

u __ * \i . — > 11 + / . J i 


changes given uy ' , £ (45) 

Ny.tjNt-uj- I/« + £/' 
k*j **J 

with d^l+hj-dj (46) 

'l^tr^r^^gr^ Son oU in .he irh inrerva, 
cannot exceed Ni-ijwe have 

j (47) 

simple form is as follows: g 

= 0 

otherwise with ./ . . (49) 

,,>0.nd £ .,<1 forlSjSJ. 


IjH r' V -- L—t 

S condilions are speeifred b, giving ». 
for L<j ^ J. for some SP 60 ' 1 * 0 1 ~ '°- 
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5 Ordinary Differential Equation (O.D.E.) Formulations 

A general ordinary differential equation is a mathematical formulation of the 
form 

ot(t)D iH, :(t) = (l{:(t),Dz(t),... £> ,n -"r(f); u(f),Z>u(f),..,D( m -"«(/).0(0J (50) 

where D is the derivative operator, i.e. D:(t) = dr(r)/df and D ,n, :(t) = 
D(D in ~ n z(f)); w(r) is a specified function which is differentiable m times; z(t) is 
the dependent variable (either scalar or vector); 0(f) and 2 (f) are the parameters 
and f is the real valued independent variable. The interval of interest is given by 
T. Without loss of generality we assume that 0^f < T with T being either 
finite or infinite. 

When the formulation becomes a model, f represents time and r(f) and u(t) 
the variables of the system characterization. 


5.1 Classification of O.D.E.s 

The equation is said to be of order n if the highest derivative of the dependent 
variable is of order n. 

If the parameter 2 (f), the coefficient of the highest derivative of the 
dependent variable, is non-zero over the interval of interest, then the equation 
is said to be non-singular ; if not, it is said to be singular. In the case of the non¬ 
singular equation we can set the coefficient of the highest derivative to one, by 
dividing both sides of the equation by 2 (f). 

If i^(/) is equal to zero over the interval of interest and 0(f) does not depend on 
f, then the equation is said to be autonomous ; if not, it is called non-autonomous. 

If the function g is linear in r(f) and y(f) and their derivatives, then it is called 
a linear equation ; otherwise, it is called a non-linear equation. 

If 0(f) does not change w ith t then the parameters of the equation arc said 
to be time invariant. If not, they arc said to be time-varying. 


5.2 Alternative Representation 

The formulation given in (50) is of order n. Assume that 2 (f) = 1 and define an 
enlarged vector y(f) as follows: 


T(r) = [r(f),D_-(f).D‘"-"r(i)] r . 

Then (50) can be rewritten as 

DYU) = G(Y(i).U(t)JKt)) 

where 


(51) 


(52) 


l/(r) = [u(r),Dw(f).D'""M(f)] r . (53) 

Thus the equation is reduced to a first-order differential equation with the 
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S.3 Existence and Unique-** of Solution , 

For the solution to exist (i.e. a function ( £‘ mus t be suitably 

over the interval of interest) the ^"^Hne needs to specify some 
constrained. To ensure that the solutio ^ ar£ gjven by Y (t) being 

extra conditions. In the case o ( ). (hen lhc formulation is called an 

specified at some single value of ^ ^ of Us values at two or more 

a two. o, multi-point boundary value formula.,on. 


5.4 Case Study D [Cont.] (Rocket Launch) 

is not significant, and hence can be ignored. 


Model D.l 


shonmTn F^grme*3.U, t vv« S l'a^ r f^^ r f^ e *^ l °"'^ l " a ”*^ re< 


Mx”(t) = F{t) -G-f 


(54) 


where F is the thrust 

the earth measured outwards. itationa i attraction we have 

From the inverse square law for gravitation 


M(/ 0 K 2 

(xUf+R) 1 


(55) 


where R is the radius of the earth ^^velocity xXOas well as the density of 
The frictional force / depends on the velocrty x ( " for f is 

air, which varies with the position x(r). A general 

/=/(x(f),x'(0) 1 

W= assume tha, the density variation can be *£*1. and >h». the (ncuon. 
force varies linearly with the velocity. This results 

f=kx'(t) 


W *UweVssumethat the thrust generated lasts only for a very short time we can 
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treat it as an impulse, resulting in 

F(t) = F m 6(t) (58) 

where S(t) is the Dirac delta function. Note we have assumed that the rocket is 
launched at / = 0 and as a consequence we have the initial conditions given by 

x(0) = 0 and x'(0) = 0. (59) 

Thus the model is given by 

Mx"(t) = F m S(t) -(60) 

1 m (x(f)4-K) 2 

with the initial condition given by (59). Note that the model involves a non¬ 
linear differential equation formulation. 

5.5 Case Study E [Cont.] ( World Population) 

The system characterization for this problem was done in Chapter 3, Section 
6.5, and in Section 4.5 of this chapter we examined a few models based on 
discrete time characterization. In this section we discuss modelling the system 
with continuous time characterization. 

Model E.4 

This is the continuous time counterpart of Model E.l. Let N(t) represent the 
total population at time t. In the interval (M 4- 6t ), N(t) changes due to births 
and deaths. As a result, we have 

N(t 4- St) = N(t) 4- aN(t)St - bN(t)St (61) 

where b and d are the birth and death rates, and are constant. 

In going through the limit, we have 

lim { N(t 4- <50 - N(t) }/St = N'(f) = (b - d)N(t). (62) 

* 1-0 

Thus the model is given by a simple first-order linear differential equation of 
the form 

N'(t) = a N(t) (63) 

with 

d = b — d (64) 

and the initial condition by specifying N(t) at some time instant t 0 . 

Model E.5 

In the earlier model we assumed that both the birth rate and the death rate are 
constant. As a result a, given by (64), is a constant. A more realistic formulation 
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general non-linear equation of the form 

where 0 is a *. or paramos. ™ S .napl.es .hauhe mode! is g .»en b, a pop- 
linear differential equation 

N'(t) = f(N(t),0)N(t). 

Various forms for fW)> 0) can be used. We indicate a few: 

(Ol) 

(i) f(N(t),0) = 0 o {L— N(f)} 

(i\)f(N(t),o)=[Oi-p^gmt)} (69) 

(iii) f(N(t),0) = 0 2 {L~ N(t )I 


,n the case of (i) and (in) we have Wh tilal p^puTadon 

condition is speeded by MW a. *>me tin.. ■»*•« 

* 0 - 

Wc now consider modelling when the stam^* ar "'"“|'°"|e a „d female 
the two sexes. The the death rates for the males and 

similar to tha, in Mode, 

E.4. we have nm 

NJt) = - d m NJD + »A(N Jf), N/D) 

S'/t) = - d f Nfl) + (< - *) A (N m U), N/10) j ( 

and the total population at time f given by 

N(t) = NJl) + Nj(t). 1 

If * = 0.5. then half the new births are males. mo del. Tw0 

One needs to specify the form of A(N JO. *M'>) to comp 
forms are as follows: Jj 

(i) A(N m (0. N /(f)) = m in ! A m (0. and J 

(ii) A(N m (0.Ny<t)) = ^v / Am(0N / (0. . gWe n 

i . x being constants. In cither case the resulting formulation g 
by a°pafr oTcoupled non linear ord.nary d.lferential eqna„ons. 
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The initial conditions are given by N m (t) and Nf(t ) for some specified 
value t 0 . 


6 Differential Difference Equation (D.D.E.) Formulations 

A fairly general first-order differential difference equation formulation, with 
one dependent variable, is given by 

At) = f (2(f), z(t -ij), z(t- t 2 ), .., z(t - r p );u(f)). (74) 

The parameters x ly t 2 ,..,t p can be either functions of t (the independent 
variable) or constants. u(t) is a known function, r, ^ 0 (1 ^ ^ p), in order to 
ensure that the “principle of causality" is not violated when the formulation 
becomes a model. 

Higher order D.D.E. involve higher derivatives of z, and using the technique 
of Section 5.2 one can reduce it to a vector first-order D.D.E. With two or more 
dependent variables the formulation is similar to (74) comprising a set of 
coupled equations. We shall omit the classification as it is similar to that for the 
ordinary differential equations (see Section 5.1). 

To ensure that a solution exists the function / must be constrained, and to 
ensure that the solution is unique one needs to specify z(t) over a non-zero 
interval of t which is longer than the largest value of z i9 1 ^ i ^ p. This is in 
contrast to the ordinary differential equation formulation, where for a first- 
order equation one needs to specify the dependent variable for a specific 
point f 0 - 


6.1 Case Study E [Cont.] (World Population) 

Model E.7 

Consider the continuous time system characterization for the problem 
discussed in Chapter 3, Section 6.5. We are interested in the total population 
N(t). This changes due to births and deaths. If we include the gestation period, 
then the population change in the interval (f,r 4* St) is given by 

N(t + St) - N(t) = {bN(t -t)- dN(t)}St (75) 

where b and d are the birth and death rates and t is the gestation period. 
This follows as births in the interval (f, f + <5f) depend on the population 
at (r - r). Going through the limiting process, we have the model given 
by 

N'(t) = bN(t-x)-dN(ti (76) 

involving a D.D.E. formulation of order 1. The initial condition is given by 
(0. -r ^ t ^ 0, so that the model can be used to obtain N(t) for t > 0. 
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7 partial Differential Elation (P.D.E.) Formulations 

Partial differential equation fonmOationsiivrolvethe^paJtia^ ^ ^ of mo , e 

L^derv"we Brit discuss lormoiation involving two independ- 
cnt variables. 

7.1 Formulations with Two Independent Variables 

Let x and t represent the ‘^^ ’"^^"Let^artSrdlrWatTves^ dented by 

-d so on. 

A gene/al P.D.E. with one dependent variable is given by 

f/(u, u x ,u,, u xx , u xr u„ . 0 (x,f),/(x,f ))-0 

a riw 11 i« \ known function. The order of 
where (H-x, 0> s a set of parameters an ( '‘ derivat i v e occurring in the 
the P.D.E. is given by the highest partial der.vat. 

'Tst'oTr, of the lormoiation one needs to deline the domain o[ .he 
independent variable, X and I. It could take vanous lorms. e.g.. 

(i) t^ 0 .L x ^x^L 2 , or 

(ii) t> 0 ,L,^x<co, or 

(iii) f>0, - oo < x < co 

where L, and L 2 are finite. tcm , corresponds to time; x 

correspondTto atpauaTcoordinate and ulx, ricorresponds to a variable ol the 
system characterization. 

7.2 Classification of P. D.E. w appcar |incarl> in „ 

If the dependent variable U( h TcVVZt i Sled" nZinear equation, 
then the equation is said to be .Unear l noun «^ 

A fairly general, second-order linear P.D.E. is g J 

au„ + 2 bu, x + cu xx + Ju, + eu x + <?« -fix,t) 1 

where the parameters a,b,c,d,e 
independent variables x and t. This equation 
types based on the sign of I", given y 

r = nx, o = h 2 (x, t) - a(Xs M*. o I 

At a pom, 1,0 in the domain o, the tndependen, vanable, .be equalion ,s 
called: 


(i) parabolic if T — 0, 
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(ii) hyperbolic if r > 0, and 

(iii) elliptic if T < 0. 

If the parameters a , b and c are constants then the equation is either parabolic 
or hyperbolic or elliptic over the whole domain of the independent variables. 


7.3 Existence and Uniqueness of Solution 

For the solution to exist, the function g must be suitably constrained. To 
ensure uniqueness one needs to specify additional conditions. 

The number of conditions needed depends on the order of the differential 
equation and the nature of the conditions depend on the range of the 
independent variables. 

In the case of a first-order P.D.E., one could specify the condition by 
specifying w(x, f) along a curve in the x-t plane. If it is of the form 

u(x,0) = r(.v) (80) 

where r(x) is a known function, then the formulation is called an initial value 
formulation. 

For a second-order parabolic equation a set of conditions which will ensure 
uniquencess is given by one initial condition (I.C.) and two boundary 
conditions (B.C.). For example, if 0 ^ t < oo and - x < x < x , the following 
conditions 


I.C.: m(x, 0) =/ 0 (x); — x < x < oo (81) 

B.C.: | u( y, /) | < oo as .v-> + x 


are sufficient to ensure that the solution is unique. 

For a second-order hyperbolic equation a set of conditions which will 
ensure uniqueness is given by two initial conditions (I.C.) and two boundary 
conditions (B.C.). For example, if 0^/<x and L,^x^L 2 , then the 
following conditions: 


/u(x,0) =/ 0 (x) 

\c?ii(x,0 )/dt =/i(x) 


E | ^ X ^ La 2 


( w(L,,f) 

\u(L 2 j) 


= 0i (0 
= 02(0 


0 ^ t < x 


arc sufficient to ensure that the solution is unique. 


(82) 

(83) 


^•4 Case Study E [Cont.] ( World Population ) 

In Sections 4.5 and 5.5 we examined several models based on different system 
characterizations. In this section we discuss modelling population changes 
w,l h a system characterization which takes into account the age structure of 
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the population and treats time as being continuous. This leads to a P.D.E. 
formulation. 


Model E.8 


Let N(f,x) be a variable charactenzing^thc age structure of t P P n by 

time r. More specifically, the populaton in the age group , 



N(l,x)dx 


(84) 


The birth and deulj^Chapter 5. The 

0 <. < * and 0 « x « where x. . 

the upper limit to human age. v + 8x\ then the total 

If we consider a small age interval - e.g. (X. x + M. 
population in this age group at time . ts given by 

N(x, t)£x + Ofdx 2 ) ,85 ’ 


ThUS mx.1+ »)**- mx-MSx-^x-M«x)Sx + Wx>). m 


The first term represents people in the age group ,x - dr. x, a. time , and the 
a tprm rpnresents the reduction due to deaths. 

S Tn eZS -ng Taylor series and taking the limit, we have 


<?N(x. 0 
dt 


mx. t) 
dx 


— d(x)N(x, f) 


or 


dN(x,t) dNjn. t) 

1l ^ 


- (f(x)N(x, t) 


New additions at time t are given by 


N(0, f) = 



b(x)N{x, t)dx 


(87) 


( 88 ) 


Et.ua,ion ,88) can be as » ““e have Nix..0-0. The 

x - X. (to ensure that no on y „ f or some l 0 - viz. 

initial condition is given by speciiying ^ 

N(x,f 0 ) = C(x), O^x^x, 

Thus, in this ease, the model involves a linear firs,-order partial d,detent, 
equation formulation as a dummy. 
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7.5 Case Study F [Cont.] ( River Pollution) 

The system characterization for this problem was done in Chapter 3, Section 
6.6. In this section we discuss building of a model using the simplest 
characterization where the river is viewed as a one-dimensional channel with 
constant cross-section. 

Model F.l 

Let T(.x, /) represent the concentration level of pollution at a distance x from 
the point of discharge at time t. From the system characterization we have 

- Lj ^x ^ L 2 and O^rcoo (90) 

which define the range of the independent variables. 

The concentration level changes due to diffusion and transportation. Let D 
be the coefficient of diffusion and let U be the velocity of the river which is 
assumed to be constant. The diffusion is governed by Fick’s first law of 
diffusion, which states that the diffusion current vector J is given by 

J = — DdT(x , t)/dx (91) 

From the equation of continuity for the diffusing substance, we have 

ITM_ t ,fT(x, t) yifcQ 

St ~ Sx Sx‘ ’ 

This is obtained by considering a small volume centred at x at time t and 
evaluating the net inflow and outflow of pollution in a small time dt. The net 
change in the small volume can be related to the net inflow and outflow and in 
the limit <5f-+0 and dx ->0 we have the above equation. 

Thus, from (78) and (79) we see that the model involves a parabolic partial 
differential equation. We need to specify additional conditions to ensure 
uniqueness of the solution. These are given by 

I.C.: 7(x,0) = G(x), - L, ^ x ^ L 2 (93) 

indicating the concentration levels along the river at t = 0, and 

B.C.: T(— Lj, 0 = 0 

T(L 2 J) = 0 0<r < oo (94) 

The above model does not take into account the discharge of pollution at 
v = 0. This can be easily accounted for by modifying (92) to include an extra 
term F(x,f) which is of the form 

F(x, t) = oc0(O<5(x) (95) 

*hcre (/(f) is the time history of dumping and 5(x) is the Dirac delta function. 
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7.6 P.D.E. Formulations with More than Two 
Independent Variables 

The dependent variables are functions of equation is of 

and are of the f °™ derivatives with respect to all the 

b Te4hefo,l 1 a.ionsa re s.raigh,forward .x,e»sK,„sor 

the two independent variable case. 

Notes and References 

■ I no iKp theory of mathematical 
Function opfW**** formu a sma ,| sample of books on this topic 

—-— 

di D^nce equation formulations 

t^TlaTrtent'lt 1 w"b ^ ,0rmUla ' i0 " S 

in modelling physical differential equations can be 

The large number of books o belonging to the first category 

broadly divided into two categor _ exfstence and uniqueness of 

concentrate on theoretical aspects (fo Pj ( [, 4 ] an d [15]). Books 

solutions) and vary from introduct y concentrate on the methods 

belonging to the second category ^§7™‘^Siting book which not 

range of problems. , r251-r271. Such formul- 

For more on differential TO 

ations have been used extensiv > differential equation formulations; 

We have not discussed integral and integro-diliereniiaicq 

[29]-[31] give a small sample of b °° k ^ differential equations. As with 
There arc many books which discuss p conce ntrate on theoretical 

books on ordinary differential equal™’ 0 r°“| y “"S applications. Items 
aspect, whilst others deal w,th,he second category, la 
[32]-[38] are a small sampeofbf. ■ J n 8 with specific formulations - 

-fSiTtS "r'aSc e d ua,,o e n formulations and [«l - 
[41] deal with hyperbolic equation k™^ , °^ hematica i formulations 

--- - - 
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used to model the qualitative behaviour of a wide range of systems and 
phenomena. For an introductory presentation see [42] and for more details 
see [43]—[45]. 
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CHAPTER 6 

Analysis of Mathematical 
Formulations - I 


1 Introduction 

Once a mathematical model is constructed (by associating an abstract 
mathematical formulation with the variables of the system characterization) 
the next step is to obtain the behaviour of the model. This is done by the 
following three-step procedure. 

Step /: Discard the association (mentioned above) so that only the 
mathematical formulation is retained. 

Step 2: Carry out an analysis of the formulation using appropriate 
mathematical techniques. 

Step 3: Re-introduce the discarded association so that the analysis can be 
interpreted in terms of the physical variables of the system characterization 
to yield model behaviour. 

In this chapter we focus our attention on the analysis of mathematical 
formulations considered in the previous chapter. As such, we start with a brief 
discussion on the different types of analysis. 

2 Types of Analysis 

Broadly speaking, the analysis of mathematical formulations can be either 
qualitative or quantitative. As the name suggests, qualitative analysis deals 
with l ^ e study of qualitative aspects of a given mathematical formulation 
without explicitly solving it. This is done by an analysis of the underlying 
mathematical structure of the formulation. Typically, such an analysis 
provides answers to questions of the form: 

to) Is the solution of a given O.D.E. formulation periodic? 
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,„) Does the so,u,ion of 0 g,veo difference equa.ion for™!.,ion go .o eero.s 
the index n approaches infinity? 

The answer is usually binary valued,'^^^ne^with finding the explicit 
In contrast, quantitative analysts formulation. Quantitative 

solution which satisfies the given ma d analytical methods 

analysis can be dividedrole in mathemat- 

W “SrScronte can be fur,he, subsided as indicated beiow. 

2.1 Analytical Method of Analysis 

In this method the solution to a given the'ind^nd- 

terms of an analytical express,on *1* exac, or 

ent variables of the formula i • . n i llt ion satisfies the formulation (i.e. 

approximate, depending on whether lhe s exactl or approximately. 

the equations and the associated co J . possible to obtain 

Only for a small class of mathematcal^ ^ ^ ^ 
exact analytical solutions. A PP‘‘ ox,, " a an cxact solutio n. The approxi- 

extremely difficult, or impossible, solutio n of a modified mathemat- 

Sfol U uiatiorwhich 0 isdose to the original formulation in some sense. 

2 2 Computational Method of Analysis 

approximate and depends on e yp ter j ze( j solutions - that is, the 

Analytical methods of analysis yie P mcters Q f t he mathematical 

solutions are analytical expressions ^ of analysis yield solutions 

formulations. In contrast in the latter case, the 

for specific numerical values of the parameters. f values arc altere d. 

solutions have to be recomputed every 1 ana lysis are applicable only 

Another difference is that analytiral methods of anal^s^s a ^ .„ 

to a small class of mathematica . x tems contrast, computational 

determining model behaviour o ct P jd j 0 f mathematical 

methods of analysis are applicable to a much wider 
formulations. 

3 Types of Computers and the Nature of 

Computed Solutions f 

,n this section we firs, discuss .he different W»£Ration 

There are essen " a " y 


ANALYSIS OF MATHEMATICAL FORMULATIONS - I 89 

three types of computers digital, analog and hybrid. The essential features of 
each will be discussed in turn. 

3.1 The Digital Computer and Digital Computing 

A digital computer operates with real and integer numbers stored as binary 
digits. As a consequence the computer stores real variables only approxi¬ 
mately, because of the limited precision due to finite word length and 
continuous functions as sequences for discrete values of the argument. The 
operations are executed according to a pre-specified program - a set of 
instructions which the computer recognizes and executes sequentially. 

In general, the program is written in a higher-level language and the 
computer translates this into a binary pattern before it can execute it. For 
solving specific mathematical formulations (e.g. O.D.E. formulations) one can 
either use a general higher-level language (e.g. Fortran, Pascal) or a special 
higher-level language especially designed for solving specific formulations (e.g. 
CSMP for O.D.E. formulations). The former offers greater flexibility at the 
expense of greater programming effort from the user, while the latter requires 
less programming but is also less flexible. Some specific higher-level languages 
(also called simulation languages) will be discussed in Chapter 9. 

Because of the nature of the computer, the solutions of dynamic formulations 
with continuous independent variables can in general be obtained only for 
discrete values of the independent variables. We should mention, however, that 
there exist very high-level languages and programs (for example, M ACS YM A) 
which can perform symbolic manipulations on the continuous variables. The 
study of different methods for solving mathematical formulations on digital 
computers belong to the theory of ‘'numerical methods”. We shall discuss 
some specific methods for solving different formulations later in the chapter. 

3.2 The Analog Computer and Analog Computing 

Analog computers operate with variables represented as electrical voltages. 
The computer consists of a variety of blocks or components satisfying different 
mathematical relationships between the input and the output voltages - e.g. 
integration, addition to name only two. By linking the components physically 
(also called patching) the voltages of different components become related and 
satisfy the desired mathematical formulation. The solution is thus given either 
by the voltage, or time history of the voltage, of one or more components. The 
analog computer is ideally suited to solving dynamic formulations with one 
independent variable. The independent variable is represented as a continuous 
variable on the computer. When the formulation contains two or more 
independent continuous variables, only one of them can be represented 
continuously on the computer. In this case the solutions are obtained as a 
continuous) function of one independent variable for discrete values of the 
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remaining independent variables. Shim tbe UagK representing 

the formulation needs to be scaled so that the tec o( scallng is 

different variables do no, exce«i ‘^^Xnssmi in the next section. 

tbe, « used 

of an essentially digital system. 

3 3 The Hybrid Computer and Hybrid Compumg 

Tbe hybrid compute, is an analog £,I»of 

Tbe advantage of such »,*» 
both machines, and at the same ooeration of each machine is 

worst features o, ettber. senes vs parallel) a 

IS. -mumcation between the 

two machines. 


3.4 Changing Emphasis in Computing 

With the developments in digital ^^^[^p^kage^for solving different 
availability of a large number o c P ^ and hybrid computers is 
mathematical formulations, t e aspe cts mentioned above, which 

becoming less popular except or rem a ind er of this chapter we shall 

are still at the research stage. Hence ’ metho ds to those used in digital 

computing' "r 9^3 return to the use of analog computers in 
solving different mathematical formulations. 

4 Dimensional Analysis and Scaling 

Since a mathematical model ^^^odeT Canute vtewed^as^having physical 
variables and parameters of . b ic fundamental units such as 

dimensions and are cta«ct««d s usctu , [or , through it. 

mass (M). length I LI andI time V ^ w ensu re ,hat it is in agreement with 
°f - system Our .merest in this section is to use ,h,s 
information for dimensional grouping and scaling. 


4.1 Dimensional Grouping 


Instead of dealing with the ortgtnal variables 
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replaced by a new set, and the formulation restated in terms of this new set. By 
proper grouping the new set can be made non-dimensional, and if some of the 
new parameters can be identified as being small they can be ignored in certain 
instances. The advantage of this is that the resulting formulation may be 
simpler and hence easier to solve. Great care must be taken in this 
simplification process or else the solution of the simplified formulation might 
bear very little resemblance to the true solution of the original formulation. 
Successful dimensional analysis and simplification requires a deep under¬ 
standing of time and length scales associated with the variables of the system 
being modelled, and a well-developed intuition. This is illustrated in the 
following case study. 


4.2 Case Study D [Cont.] (Rocket Launch) 

In Section 5.4 of Chapter 5 we discussed the mathematical modelling of this 
problem. If the frictional force can be ignored and the thrust can be treated as 
an impulse, the model is given by 

M x "(f) = F m S(t) - Mc, 0 R 2 /(R + x(f)) 2 (I) 

where the primes denote derivatives. The initial conditions are given by 

x(0) = 0 and x"(0) = 0 (2) 

Because the thrust acts as an impulse, (1) and (2) can be rewritten as 

x"(r) = “ 9o r2 /(R + *(0) 2 (3) 

with 

x(0) = 0 and x'(0) = V = FJM (4) 

Let us assume that x(f)«K. In this case we can simplify the equation to 

x"(0=-</ o (5) 

By this simplification we have converted the non-linear equation (3) into the 
linear equation (5). However, we cannot be sure if the assumption x(r)« R is 
valid. This would depend on the value of K and it is not obvious as to what the 
relationship between V y y and R is to ensure that the simplification is valid. 

Towards this end, we use dimensional analysis to define non-dimensional 
groupings of variables to test the validity of our assumption. The dimensions 
of the various variables and parameters in the original formulation are as 
follows: 


Dimension 


Dependent variable 

X 

L 

Independent variable 

t 

T 

Parameter 

00 

LT~ 2 

Parameter 

V 

LT ~ 1 

Parameter 

R 

L 
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groupings. 


Gr^ns I* '-I**'' - *»* * WR The " ® “ 


d 2 y(x) _ 1 — 

£- dT r ’ __ (y( T ) +,)2 ’ 


y(0) = 0, 


y(0) = 1 


( 6 ) 


with 


£=V 2 /g 0 R 


(7) 


Grouping 2: 


Let t = r/^ST 7 = *»/« Then (3) can be rewritten as 


d 2 y(0_!_ 

~d?~ (y(t)+D 2 ’ 


y(0) = 0, y( 0) = e 


( 8 ) 


with £ as defined above. ... 

Grouping 3‘- U. t» W ^ 4^V' Th.nO.cnb.r.wn, 

as 


d 2 t(x) 

dr 2 


1 

(l +£y(t)) 2 ’ 


y(0) = 0, 


y(0)=l 


(9) 


with e as defined above. ignore £ and obtain an 

If V 2 «g 0 R' then £<< '• ,n th ac f® nd 2 are not useful. With these the 
approximate solution. The grouping re i at j on t0 the original solution. 

c , ose 10 that „f ,be 

However, grouping 3 yields a so ul her|Ilore , u leads to a linear 

original formulation whn •“ eq uatl0n which is non-linear) and 

equation (as opposed to the original q 
therefore much easier to solve. 


Sealing amounts to ^“““"“^“the'erm' MoriTomally.'i" the 

proMss of sca^mg cine attempts ^ a factor (which <**^1 

each term in the dimensional equs at . . and a dimensionless factor of unit 

estimates the term’s order of ^^^^Ition and x n the absolute 
magnitude. Thus if x(f) be a variabl with <f) = x ( f )/x m resulting 

maximum value of it. then it is tha n or equal to unity. The 

in a dimensionless variable y(t) of magn . . • dosely re lated to length 

selection of suitable intrms'c reference q considering the rocket 

and time scales of the system. We illustrate tms y g 
launch problem. 
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4.4 Case Study D [Cont.]: ( Rocket Launch ) 

Since we start with the assumption that x « R , we have the rocket move with a 
deceleration proportional to g starting with an initial velocity V. As a result the 
rocket velocity decreases and reaches zero value at x = x m the maximum 
height reached by the rocket. The time taken to reach this height is V/g 0 and 
the mean velocity during this period can be taken to be V/2. Thus the 
maximum altitude reached by the rocket is roughly V 2 /2g 0 . In other words, x m 
is of the order V 2 /g 0 . (Note: If we include the factor 1/2, we would get a more 
accurate estimate.) This suggests an intrinsic length scale of V 2 /g 0 . Similarly, 
an intrinsic time scale is V/g 0 . Using these intrinsic length and time scales, we 
can scale the variables as follows: 

X = t/VgS 1 and y(x) = x(t)/V 2 yo 1 (10) 

This is precisely the combination for grouping 3 of Section 4.2, which led to 
approximate solutions which are close to the true solution. 

5 Static Formulations 

In Section 3 of Chapter 5 we discussed two types of static formulations 
algebraic equation and function optimization formulations. In this section we 
briefly discuss some aspects of the analysis of such formulations. 

5.1 Algebraic Equation Formulations 

A general formulation is given by a multi-dimensional function 

G(x,y,0) = O (11) 

x and y being vector variables and 0 being a vector parameter. A typical 
problem in analysis is the following. Given x and 0 find y which satisfies (11). 
(This corresponds to finding the model output given the input and model 
parameters.) The solution y might not exist. If it does, it might or might not be 
unique. 

5.1.1 Analytical Solutions 

Only for a small class of formulations is it possible to obtain y analytically as a 
function of x and 0. One such is when (11) has the form 

4(x,% + B(x,0) = O (12) 

and 4(x, 0) is a non-singular n x n matrix. In this case a unique solution exists 
and is obtained by inverting the matrix A(x , 0). In the above formulation y is a 
j 1 x 1 vector which appears linearly in the equation. When y appears non- 
mearly, then there can be multiple solutions and one would need a 
computational scheme to obtain them. 
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512 Computational Solutions 

, . n for no tational convenience. Hence (11) will 

frt) , ) = y -/l( f )G(y) " Jl 

with M* a non-singular n * . main, Consider the equation 

y = 9(y) ' 1 

It is easily seen that (111 and (14) have the same set of solutions. The solutions 
fo, ,H) can be obtained by the following iteration seheme. 

y k +1 = </(yk)' k = 0,1,2,... 

w„h ,, an arbitrary initial fiZSZ Sf iSZSX. 

My)-J-M. wHere J(» is the a. 

the inverse exists. The latter has a f ‘ n an(J invcrsio n of the Jacobian 
S.TeSr p C o^“,on:0,her scheme* are obtained by diheren, 
choices for A(y). 


5.1.3 Reformulation of the Problem 

The solution to (11) can be ^ 3 ^ y to 

function minimization problem. Consider me v 

minimize H(y) given by 

H(y) = G T (y)G(y)= I lG,(y )) 2 (,6) 

„ . , <i<n are the components of G(y). Since H(y)» 0. the optimal 

where G,(>). I < ‘ < "•JL which results in H(y> being zero. This 

y which minimizes H(y ) nrnhlem will be discussed in the 

optimal y also satisfies (11). The minimization problem will oe 

next subsection. 

5 2 Function Optimization Formulations 

A .ypical formulation is « 

minimizes a scalar function f(x) often, calle^ the ^ we wrl ,e 

notational convenience we omit the parame real varjable . The case 
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form 


G(x)<0 or G(x) = 0 (17) 

we have a constrained optimization problem. If the constraint involves strict 
equality, then it is called an equality constraint; otherwise it is called an 
inequality constraint. As indicated in Chapter 5, formulation involving 
maximization instead of minimization, or a constraint G(x) ^ 0, can be recast 
into a form involving minimization and a constraint of the form given by (17). 


5.2.1 Unconstrained Optimization 

The optimal x* which minimizes/(x) must satisfy the following necessary 
condition 

J(x) = df(x)/dx = 0 (18) 

provided the derivative exists. In this case, one way of obtaining the optimal x* 
is to solve (18) using the methods discussed in Section 5.1. An alternative 
approach is to use computational methods to minimize J'(x) directly. 

5.2.2 Computational Methods 

Since J(x) given by (18) is the gradient vector, a computational method (called 
“gradient method") for finding the optimal x* is the following iterative scheme: 

+ t =**-**«J(**). * = 0,1,... (19) 

where t k is a positive scalar quantity and x 0 is chosen arbitrarily. t k can be 
either constant or vary with x k . If t k is chosen to minimize 

f(x k -tJ(x k )) (20) 

(t> 0) 

then the method involves a one-dimensional minimization at each step of the 
iteration. This approach is also the basis for a large number of methods. The 
two best known are the conjugate gradient method and the variable metric 
method. Under appropriate conditions x k will converge to the optimal x* as 

k x . 


5.2.3 Constrained Optimization 

The problem is to obtain the optimal x* which minimizes/(x) subject to the 
constraint given by (17). One approach is to convert the constrained 
optimization problem into an unconstrained optimization problem and use 
the methods described earlier. The penalty function method is one such. An 
alternative approach is via Lagrange multipliers where the constraints are 
adjoined to /(x). 
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5.2.4 Penalty Function Method 

In the penalty function method we minimize a function F(x,k) given by 

F(x,k)«/(x) + * 1 {max (G f (x),0)} 2 (2,) 

i = 1 

where k is a scalar > 0 and G,(x) are the components of G(x), i.e. ^ 

C r (x) = [G,(x), G 2 (x)..-• G m (x)] (22) 

For a given *. .be optimal x- obtained b, 

constraint, is a function of/c if one or more c . greater than 

lo the constraint being violated n As increased 

zero and reflects the penalty 'pod for ^ a , g so , ution to satisfy the constraint 
the penalty increases. This f P the optimal solution to the 

— -— 

solutions for the unconstrained formulation. 


5 2 5 Lagrange Multiplier Method 

rssss=S2Si 

function by means of an m-dimensional g t 
; 2 .;.J) to define a function F(x,A) g> ven b y 

F(x.A>=/M+J/tCM ,23) 

Fix, A) is a rune,ion of tt + ttt pj^prinia^valuc^of x 

function yields an optimal x and A . - — 0 The x* and 

which°minimizes /W subject to the equality consIntm CM 0, The 

A* satisfy the necessary conditions (for optimality) g y 

df(x)/dx+ lx i dG i ( X )/dx = 0 (24) 

(for optimality) given by 

m ( 76) 

df(x)/dx + £ X t dGi(x)/dx = 0 
1=1 


£;. i 3,(x) = 0 A^O, \<i<m 


( 27 ) 
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This optimal x* also minimizes f(x) subject to the inequality constraint 
G(x) < 0. The method works only when the Lagrangian multipliers exist. Their 
existence can be assured if the constraints satisfy certain conditions. The 
"Kuhn Tucker” condition is one such. 

5.2.6 Computational Methods 

In general, one needs a computational scheme to solve constrained formul¬ 
ations. A number of methods have been developed. They are all iterative in 
nature. Some are designed for specific forms of/(x) and G(x). One such is the 
well-known simplex method for solving linear programming formulations 
formulations where both f(x) and G(x) are linear in x. In a linear programming 
formulation the admissible region for x is a convex polyhedron and the 
optimal x belong to the set of vertices of the convex polyhedron. The simplex 
method is an iterative method which moves from one vertex to another to 
finally converge on the vertex which yields the optimal solution. 

5.3 Case Study C [Cont.] (Optimal Production) 


In Section 3.4 of Chapter 5 we formulated a model (Model Cl) for this 
problem. Using the notation of Section 3.4 of Chapter 5 and defining 



Y t = 

[T„ 

r* 

i—i 

** 


(28) 


l T = 

Oi 

— c { 

)- “ f 2 ). 

(S 3 ~ C i)’ («4 - c 4 )] 

(29) 


b T = 

[01. 

D t , 

Dy Dy *] 

(30) 



”T» 

0 

0 o" 





0 

T 2 

0 0 




A = 

0 

0 

T 3 0 


(31) 



0 

1 

0 

1 

o r 4 

i i 



the model can 

be writte 

n as 








max = l T Y—c 

Y 

0* 

(32) 

subject to 










a : 

Y^h Y$ 

= 0. 

(33) 


This is a standard linear programming formulation and can be solved by the 
simplex method. 

A large number of computer packages are available for solving linear 
programming formulations. They require the user to specify the entries for A, l 
and b. The packages contain subroutines for printing the optimal solution if it 
cx,s, s. and error messages if not. 
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6 Difference Equation Formulations 

the form .. a\ (34) 

with the initial condition given by an n-dimensional vector 

/ - \ 


6./ Analytical Methods 

invariant. 

6.2 Case Study E [Cont.] ( World Population) 

Section 4 5 of Chapter 5 we formulated three models (Models E. I to E.3) 
involving difference equation formulations. 

Model E. 1 
The model is given by 

N.-IN,.,. i>> 

wUh >, I +(.-<! ,,7) 

““ “S are eonsUnTfht solution can he obtained analytical,, and is 

e ivenby /v ( = S'N 0 (38) 

Depending on d, the solution N, either increases or decreases continuously 
with i, or else does not change at a . function of N,_,. In this 

case die equation iSSi» obtain an analytic solution 
depends on the nature of the non-linearity. 

Model E.3 

The model is given by 


N, = ( 1 —d)N i_ t +bNi- 1-* 


(39) 
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If b and d are constants, then the solution is given by 

Ni=I ' AjX‘ (40) 

j= 1 

where Xj( 1 ^k+ 1) are the roots of the polynomial equation 

A* +1 = (1 -d)X k + b (41) 

Aj, 1 ^ k + 1, are constants which need to be selected properly to satisfy the 

(k + 1) initial conditions. 

6.3 Computational Methods 

Difference equations are ideally suited for solution on digital computers. The 
alternative representation - i.e. reformulating a higher-order equation as a 
vector first-order equation (as indicated in Section 4.2 of Chapter 5), is 
desirable, as it results in the formulation being transformed into a first-order 
recursive relationship. 

7 Ordinary Differential Equation (O.D.E.) Formulations 

This class of formulations was discussed in Section 5 of Chapter 5. A general 
nth-order initial value formulation is given by 

D {n) z(t) = g{D {j) z(t), O^j^n-U D ij) u(t) y 0(f)} (42) 

where D is the derivative operator ( D (j) = d j /dt j and D i0) z(t) = z{t)) and the 
initial condition specified through z(t) and its first (n — 1) derivatives at / = 0. 

7.1 Analytical Methods 

As in the case of difference equation formulations, only for a small class of 
O.D.E. formulations can an analytical solution be obtained. One such is when 
the equation is linear and the parameters time-invariant. For some special 
cases of non-linear equations it is possible to obtain analytical solutions by 
employing ingenious methods. 

7.2 Case Study E [Cont.] ( World Population) 

In Section 5.5 of Chapter 5 we formulated three models (Models E.4 to E.6) 
involving O.D.E. formulations. 

Model E.4 

The model is given by 

AT(f) = aN(f) (43) 
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with N(f) representing total population at time t and 

with f> and J being the birth and dea,h ra.es respectively. Let N. represent the 

i " 1 atlcKd Hs. the equation is linear and the solution gtven by 

N(r)= N 0 e'“ |45) 

As in the ease of Model E l. the solution either increase or decreases 
monotonically with r, or else remains constant depending on 

Model E. 6 


Consider the model given by the following coupled non-linear O.D.E 
formulation: 


N' m (t) = - d m N m (t) + ct s fN m (t)Njt) 

TV'/f) = - <//N/U) + 0 “ . 


(46) 


To obtain the solution, we define two variables n JO and n/0 as follows. 

N m (t) = ni(t) and N f (t) = n 2 f (t) (47 ^ 

On substituting (47) into (46) we have 

n JO = - d m njt) + ooi/(0 


= -d,n,(t) + (\ -«)«J0 


(48) 


which ,s a system of linear OM* Thisen U 

lN«e?hlud th“bo« procedure we transformed non-linear equations into 
linear equations by a suitable transformation.) 


7.3 Approximate Analytical Methods 

It is often either impossible, or extremely difficult, to obtain an exact analytical 
oluUon However, it is sometimes possible to obtain an approximate 
analytical solution which is close to the true solution in some 
approximate analytical solution is obtained by solv g , of P t he 

formulation or a series of simpler formulations to >mP ro « ‘he awuracy 
approximate analytical solution. In this per- 

developed - e.g. asymptotic expansion, regu P ( pertur bation 

turbation, to name a few. We illustrate the ^ oHhe regula pe tu Caje 
technique to obtain an approximate solution by considering 

Study D. 
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7.3.1 Regular Perturbation Technique 

This technique is applicable if the formulation has a small parameter. Consider 
an O.D.E. formulation of the form 

x'(r) = F(x(f),e) (49) 

where 0 < t « 1. The solution x(f) depends on e and can be expressed in the 
form 

x(r) = x(f,£)= £ X(t)E‘ (50) 

( = 0 

Since £ is small, by ignoring terms with i > I, we obtain an approximate 
solution xjr) given by 

xjf) = X AJOfi 1 (51) 

i = 0 

The error between x(f) and x a (f) is 0(e‘) and hence by choosing / properly the 
error can be made as small as desired. Note that we obtain a family of 
approximate solutions by varying /. 

The functions AJf), 0, are obtained by substituting (50) into (49); 
carrying out a power series expansion in e and solving the equations obtained 
from setting the coefficient of e‘, i 3* 0, to zero. 

If £ appears as the coefficient of the derivative - i.e. the formulation is of the 
form 


£x',(r) = F,(x l (r), x 2 (r)) 
x' 2 (t) = F l (x l (t), x 2 (t)) 


(52) 


then the above technique fails, and in this case we need to use singular 
perturbation techniques. Formulations of the type indicated in (52) arise when 
there are several widely separated time scales in the system. 


7.4 Case Study D [Cont.] ( Rocket Launch) 

In Section 4.2 we showed that, by using grouping 3, the model for the rocket 
launch is given by 

/( t) = d 2 y(T)/dT 2 = - 1/(1 + £y(r)) 2 (53) 

with 


>■(0) = 0 and y'(0) = 1 (54) 

and the variables y(r) and t related to the original variables x(r) and t as 
indicated in (10). If V 2 «g 0 R then e«l, in which case we can obtain 
approximate analytical solutions using the regular perturbation technique. As 
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such, we assume the solution to be of the form 


y(r) = y(*. £ ) = ^ 

and substituting in (53) yields, 

£ e'd 2 y,(T)/dt 2 = - {1 + £ £ q e'y.M} *• ^ 

On expanding .he” rigb.-h.nd aide in a power series and eqnai.ng .he 
coefficients of e° we have 


d 2 y 0 (T)/dT 2 = - 1 

y o (0) = 0 y'oW = 1 

This yields the first approximate solution y a (i) given by 

y 0 (T) = y 0 ( T ) = t — t 2 / 2 


(57) 


(58) 


The error between y a (t) and yW*><*«)• . (f) By equating the 

To obtain a more accurate solution we evaluate y iW- y 
coefficients of £ l in (56) we have 


d 2 y 1 (T)/dt 2 = - 2y 0 ( T ) 

y,(0) = y't(°) = 0 

Using y 0 (t) from (58) in (59) we have 

y,(t) = {t 3 /6 — t 4 /24 }. 


(59) 

(60) 



Figure 6.1 Exact and approximate solutions. Case Study D. 
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The second approximate solution y a (r) given by 

y a (r) = r — r 2 /2 + e{t 3 /6- t 4 /24} (61) 

is accurate to 0(£ 2 ). By evaluating y,(t), i Ss 2, the accuracy of the approximate 
solution can be improved. 

Plots of y a (r) obtained by (58) and (61) are shown in Figure 6.1 for £ = 0.5. 


' 7.5 Computational Methods 

As indicated in Section 5.2 of Chapter 5, an nth-order O.D.E. formulation can 
be reduced to a vector first-order O.D.E. formulation. Hence, without loss of 
generality we can confine our attention to solving vector first-order equation 
formulations. Let the equation be given by 

X'(t) = F{X(t)) (62) 

Solutions for this on a digital computer can only be obtained for discrete 
values of r. Let represent the computed solution for t = t, and -Y(r,) the true 
exact solution. 

A variety of methods are available for computing X t . These can be divided 
into: 

(i) single-step methods - if X t is obtained using only X, _and 

(ii) multi-step methods - if X t is obtained using X t . k -\ 1 < k < K with K > 1. 

Single-step methods are self-starting, whilst multi-step methods require some 
single-step method to initiate. 

Each of the above two methods can be further subdivided into explicit 
methods and implicit methods. Explicit methods do not involve F(2f,) to 
obtain X h while implicit methods do. As a result, for implicit methods an 
iterative scheme is needed to solve a system of non-linear algebraic equations 
to obtain X t if the function F(AT) is non-linear. The advantage of implicit 
methods over explicit methods is that they are more stable, but they require 
greater computation per step. 

The step size /i, = t t — f,_, plays an important role. The error between the 
computed solution and the exact solution is a function of the step sizes used 
and the order of the method. For a pth-order method with fixed step size h, the 
error ^0(h p * 1 ). 

If the equation to be solved is “stiff”, i.e. some components of X(t) change 
slowly while others change rapidly, then one needs to use highly stable 
methods. Again a variety of methods have been developed, some of which 
require not only the evaluation of the function F(X) but also the Jacobian J(X ) 
(= dF(X)/6X) at each step. 

Most mathematical software packages contain a variety of subroutines for 
solving initial value O.D.E. formulations by different methods. Many have 
automatic step size control. This allows for the step size to be varied so as to 
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ensure rhaUhe local error lei,her wh"n 

«ro. or a eo.ponen, o, lire so,u,ion 

reaches some specified value. 


7.6 Case Study D [Cont.] (Rocket Launch) 

equation by u2ng sub-routin^D 02 ^GF^omJheN^^p^kage.^he^eason 

negative, indicating that the peak altitude 
has been reached. d in order to use the subroutine this 

need^to be^riltcn'as'a'system o, frrs,'-order e q ua,,o„s. Towards this end. we 

define x ( and x 2 as follows. 


x,(t) = y(T), x 2 (r) = y(T) 

Then (53) can be re-written as 


x',(t) = x 2 (t) 

x' 2 ( T ) = - 1/(1 +ex(t)) 2 


(63) 

(64) 




7 7 Solution of O.D.E. (Boundary Value Formulations) 

general t.p.b.v.f. is given by 

x' — /(x, u, t) to^t^tf 

where x is a .-dimensional v«c,or. MOJJP-W 
conditions are specified in terms of n, functions U) 
functions (g 2 = 0) at f - t f , i.e. 

9l (x(r 0 Mo) = ° (”• dimensional) 

gMtf),t f ) = 0 (n — «i dimensional) 

„ is assumed rha, .be boundary condrrions ensure a unique solution. j 

specified value. 
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Figurl 6.2 Plot of vs r for difference 


7.7.1 Methods of Solution 

Analytical solutions are obtainable only for a small class of formulations (e.g. 
linear equation formulation). A variety of methods are available for solving 
t.p.b.v.f. The two most important ones arc: 

(i) Transformation method : in this method the t.p.b.v.f. is transformed into an 
initial value formulation - e.g. Riccati transformation for solving linear 
t.p.b.v.f. 

(ii) Greens function method : in this method the t.p.b.v.f. is transformed into an 
integral equation formulation. 

In general, a computational procedure is needed to obtain the solution. Two 
methods for obtaining the solution are the following: 

(i) Shooting method: here the missing (n — n ,) conditions at 1 0 are guessed, and 
the equation solved as an initial value formulation. In general, the 
computed values at t f will not satisfy the condition g 2 (*(//), //) = 0. An 
iterative scheme is used where the initial conditions are altered during each 
iteration so as to yield the desired solution. 

(ii) Quasi-linearization method: in this method the non-linear t.p.b.v.f. is solved 
iteratively as a sequence of linear t.p.b.v.f. 

Again most mathematical software packages contain a variety of subroutines 
for solving t.p.b.v.f. by different methods. 
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8 Partial Difference Elation (P.D.E.) Formulations 

formulations. 


8.1 Analytical Methods 


8.1 /inuiyii*-ui -- . . 

Only fo, a small class of formulations is it P^ h “ “JJ™, 

solutions. One such is a linear equation mmu produce analytic 

B.C.S. A variety of techniques have tan^ integra , 

transform 'methcalfs^arati'on of variables and Gita's function method. 

8.2 Case Study F [Cont.] (River Pollution) 

In Section 7.5 of Chapter 5 we formulated a model (Model F. II tor studying 
the dispersion of pollution in a river. 

Model F. 1 . .,« on j Vipnpp u — 0 As a result, the 

Assume that the velocity of flow is negligible and hence V 

model for pollution concentration is given by 

d>T(x, t)/dx 2 = (l/D)5T(x, t)/dt (67) 

with the B.C. and l.C. given by 

B.C. T(-L lt t)=0 = T(L 2 
1C T(x,0)=G(x) 

We shall use the method of separation of variables. The method assumes a 
solution of the form 


2 ,l), 0sSt<ooj 


( 68 ) 


T(x,0 = W(x)K(t) 


Substituting this in (67) yields 


1 dK(0 .r (r y d2VV W 


implying 


1 d 2 W(x)_1 dK(Q 


= - A 2 


(70) 


(71) 


W{x) dx 2 DVU) dt 

As a result W(x) is given by 

H/ (X ) = ^ (C^sin A(x + L { ) + D x cos/(x + L { )} 


(72) 
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and K(0 is given by 

K(0 = e' D2J ' (73) 

Since the B.C. have to be satisfied, we have D x = 0 and /. must satisfy 

A(L, + L 2 ) = mr. n = 0,1,2... 
or 


k = mi/iLx+Lz)', n = 0,1,2... 
As a result, the solution is given by 


* . fn7i(x + L,)) 

T(x,f)= I x n e- ' sin < - v l ’ \ 

n = 0 ( + L 2 J 


with 


y = Dn 2 n 2 /(L { + L 2 ) 2 
and x„ to be selected to satisfy the l.C. From (76) we have 


* . (mr(x + L,)] 

T(.r.0l-I o a. s ,n|— 


and for this to equal G(x) we must have 


['■> . f nn(x + L.)) . 

H- t CWs,n {(r Tnir 


(74) 

(75) 


(76) 

(77) 

(78) 

(79) 


Thus, the analytical solution is given by (76) with a„ evaluated from (79). 


8.3 Approximate Analytical Methods 

As with O.D.E. formulations, approximate solutions can often be obtained by 
exploiting special features of the formulation, in a manner similar to O.D.E. 
formulations. 


8.4 Computational Methods 

We shall confine our discussion to formulations with two independent 
variables. In this case there are two different approaches that we can use to 
obtain a computer solution. In the first approach all the partial derivatives are 
replaced by appropriate finite differences. As a result the formulation is 
reduced to a two-dimensional difference equation which relates the dependent 
variables for discrete values of the two independent variables. In the second 
approach only the partial derivatives with respect to one independent variable 
are replaced by finite differences. As a consequence the formulation is reduced 
to a system of O.D.E.s which can be solved by methods discussed in Section 
7.5. The step size used in the discretization of the independent variable affects 
the accuracy as well as the stability of the computational scheme. Various 
schemes have been developed for solving specific types of P.D.E. formulations. 
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8.5 Case Study E [Cout.J t World Populations, 


Model E. 8 

The model is given by 

dN(x,t) dNjx.t) 
dt + Sx 


-d (x)N(x.t) 


with 


and the l.C. by 


N(0,f) = 



b(x)N(xJ)dx 


N(x. 0) = G(x) 

(Note that the domain for N(x,t) is given by 0 < x < x„ and t > 0.) 


(80) 


(81) 

(82) 


Let f„ j = 0,1,2..., be a seque "“ jiJj^quMitityl^R eplacing 8 the partial 
with r.-l* and h being » jXard difference, i.e. 

derivative with respect to f by a first-order DacK 

8N(x , tj) _ N(x, t,)— N(x,t|-i) (83) 

and using Ihc noia.ion N W for NM TO 8** ^ 10 ’ S ’' SKm 
O.D.E.s given by 

dN,«/d*+to/*)+dwiNW -dm- .<*> 

and C* m ( oc\ 

N,(0)= b(x)N j(x)dx 
Jo 

f0 Vtos y t m of ODE. equations can be solved recursively as 

N 0 (x) = N(x,0) = G(x) 

, ki / \ n v c v an iterative technique 
is known. Since N,<01 depends or,«W ’ h technique is guess 

Will be needed to obt^n N.W g ivein N,- ^ ^ computed solution the 

0) and obtain N&x) by solving ( )• 8■ gueS sed value of 

integral in (85) can be is modified and the 

N (0). If the two are not the same, the gues 

procedure repeated till (85) is satisfied. 
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Notes and References 

The appeal and the use of analog and hybrid computers to solve mathematical 
formulations have diminished considerably due to developments in digital 
computers and mathematical software since 1970. As a consequence, there 
have been no recent books on analog or hybrid computers. [ 1 ] and [2] are two 
of the most recent books on these computer types. In contrast, the list of books 
on digital computers and digital computing is not only very large, but 
continues to grow at an exponential rate; [3]-[5] show a small sample. 

For more on dimensional analysis, see [6]; on simplification and scaling, see 
[7]. Item [8] deals with scale effects in animal locomotion. Books on analog 
computing contain a detailed discussion on scaling of differential equations. 

Numerical and computational techniques for solving static optimization 
models have received a lot of attention. Books on this topic vary from general 
(e.g. [9]-[l 1]) t0 those giving an in-depth study of a particular method (e.g. 
[12]) or type (e.g. [13] and [14]). Most elementary books on numerical 
analysis or methods (e.g. [15]-[l 7]) contain one or more chapters on solving 
linear and non-linear algebraic equations. 

Books on ordinary differential equations (e.g. those cited in Chapter 7) 
discuss both exact and approximate analytical methods of solutions. Item [18] 
deals with the use of similarity methods in solving, and [19] is one of many 
books which discuss solving differential equations using physical understand¬ 
ing. The number of books on approximate methods of solution is also large. 
Some of these are general (e.g. [20]) while others deal with specific techniques 
(e.g. [21] and [22]). Items [23]-[25] are a small sample of books on numerical 
methods for solving differential equations. 

Most books on partial differential equations discuss a variety of analytical 
solution techniques. See the list at the end of Chapter 5 for a short selection of 
such books. Numerical methods of solution is an active area of research and 
new schemes are constantly being developed. Books on numerical methods 
can be divided into two categories. Those belonging to the first category are of 
a general nature (e.g. [26]-[29]) while those belonging to the second deal with 
a particular method or type of formulation (e.g. [30]). 

NAG and IMSL are two fairly well-known software packages which 
contain a large number of programs for solving a variety of mathematical 
formulations. For further details see [31] and [32]. 

[1 ] Korn. G. A. and Korn, T. M.: Electronic Analog and Hybrid Computers , 2nd edn. McGraw- 
Hill. New York, 1972. 

12] Bekey, G. A. and Karplus, W. J.: Hybrid Computation. John Wiley. New York, 1968. 

[3] Davis, G. B.: Introduction to Computers , 3rd edn. McGraw-Hill, New York, 1977. 

NJ Barlec, T. C.: Digital Computer Fundamentals , 5th edn. McGraw-Hill, New York, 1981. 

L-] Goldschlager, L. and Lister. A.: Computer Science: A Modern Introduction. Prentice Hall, 
Englewood Cliffs. NJ. 1982. 

[6] Taylor. E. S.: Dimensional Analysis for Engineers. Oxford University Press, Oxford, 1974. 

I J Lin. C. C. and Segel, L. A.: Mathematics Applied to Deterministic Problems in the Natural 
Sciences. Macmillan, New York, 1974. 
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CHAPTER 7 


Mathematical 
Formulations - II 


1 Introduction 

In Chapter 5 we discussed mathematical formulations suitable for modelling 
deterministic systems. When uncertainty is a significant feature of the system, 
the system characterization must be done in a stochastic framework. As a 
consequence the mathematical formulations which can be used as dummies in 
the modelling must allow the variables to change in an unpredictable fashion, 
so that they adequately model the uncertainty of the system when the 
formulation becomes a model. Such formulations belong to the theory of 
probability and stochastic processes. 

In this chapter we discuss some of the mathematical formulations from the 
theory of probability and stochastic processes that are suitable for use as 
dummies in modelling systems that involve uncertainties. We commence with 
a brief discussion of random variables and general stochastic processes. 
Following this, we discuss the different types of mathematical formulations 
that are available for modelling systems with uncertainty. Later, we examine 
some of these formulations individually. For each formulation we give a brief 
introduction to the general structure, discuss some important features and 
finally illustrate their use as dummies in modelling one or more of the case 
studies discussed in Chapter 2 using the system characterization done in 
Chapter 3. 

2 Random Variable and Stochastic Process 

A random variable X(io) can be viewed as a function which assigns a number 
to every outcome a> of an experiment. The domain of A'(co), ft, is the set of all 
experimental outcomes (a>) and range, R, the set of real numbers. Since the 
outcome of an experiment is uncertain, the value assumed by X(d) is also 
uncertain. 

A rigorous approach to characterizing X((o) would commence with an 
axiomatic characterization of ft; the probability measures on sets of outcomes 









112 


mathematical modelling 


belonging to Q and X(o» defined as a set function. An alternative characteris¬ 
ation of XM is through a distribution function F(x,0) with 
F(x,0) = prob{X(cu) <x}, — oo < x < 

a " d + XMis 

characterized by a .-dimensional distribution function of the fottn 
Fix,.*,.*,.x„;fl| - prob{X,M«x.X» < x.) 

.there X,. X,... X, ate the components ofXhe vector XM- v „ iable 

A stochastic process can be viewed « “ variables , and to and 

-i— - -j- in aD 

iSlfmSttte or infiihteland -responds, urne 

ras • -£*z 

'■«— 

Le. it is a function of the mdependem vana e t aromatically 

A general stochastic proceaXW >» «“ a ' a " n , dis , ri bution functions. 

^hr^gh^n a^ap^oac^vmgy ^ ^ The „ the 

distribution function 

F(x 1 ,r 1 ;x ! ,t ! ;.-;x..t.)»P'ob!X(t I )«*„X(l ! )«x 1 .X(,.)<«., < 

charac,. rises Xft.«.) torn valuertoft ;« 

2.1 Markov Property 

A stochastic process X(t) is said to have the Markov property ,f. for 

,,<r ! <r 1 .<r..,<r..(t,ar.lSf«n),weha.e 

prob{X(t.X*.|X(t,) = x,.X(l.-,) = x.-il (4, 

= prob{X(t„)^x (l |X(t ll -i)= s x (l - l } 1 

In this case the ,,-dimensional distribution funetions characterizing 


representing an outcome of X[t). x t -x m arc determine m ^ 

XlOat r = r,^i.'« 
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{X(t l ),XU 2 )y.">X(t n )} for n> 2, can be written as a product of two- 
dimensional conditional distribution functions of the form F(x h f,|x, _ x , f,_ x ). 
Thus the process is completely characterized by specifying the two- 
dimensional joint distribution function F(x h f,;x,, tj) for all f,, r^eT. 

Stochastic processes having the Markov property are called Markov 
processes; otherwise they are called non-Markov processes. 


3 Types of Formulations 

Broadly speaking, the mathematical formulations suited for modelling 
uncertain systems can be divided into two categories: 

(1) static formulations 

(2) dynamic formulations. 

3.1 Static Formulations 

Static formulations are mathematical formulations involving algebraic 
equations or functions with one or more of the variables being random. The 
variables can be either scalars or vectors, either discretely or continuously 
valued and either constrained or unconstrained. When the formulation 
becomes a mathematical model, there is a one-to-one correspondence between 
variables of the formulation and the physical variables of the system 
characterization. Such formulations are suited for modelling uncertain static 
systems. 


3.2 Dynamic Formulations 

Dynamic formulations are mathematical formulations involving stochastic 
processes with the independent variable ( representing time when the 
formulation is used to model uncertain dynamic systems. The variables are of 
the form X (f, co) and, as discussed earlier, the formulations can be divided into 
two types Markov or non-Markov - based on whether the process X(t,a>) 
has the Markov property or not. 

Each of the above two types can be divided into four subtypes based on: 


(1) the values assumed by the process X(t) (=X(f,o>)) being discrete or 
continuous; and 

(2) the values assumed by the independent variable t being discrete or 
continuous, 

an d is as shown below: 
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f 


Discrete 

Continuous 

— 



D, 

D a 



Discrete 




X 

Continuous 

D, 

D, 



D1: Discrete State/Discrf.te Time Formulations 

* ,. 

r 2 ,... which form an increasing sequc ™l c . . the f ormu lation is called a 

If the process is Markovian an r * ■ ^ t h en t h e formulation is 

Markov chain. If the process is Markovian and r - x, then | 

called a Markov chain with infinite states. 

D2- Discrete State/Continuous Time Formulations 

In ihi^ Xu) assumes only discrete values S|.Sj - ..- s r 3ttd I assumes a continuous 

s„. 

chain formulation. 


D3: Continuous Statf./Discretf. Time Formulations 

In this X(f) assumes a continuous ranges of vahies5£d*i 

Markov sequence formulation. 


D4- Continuous State/Continuous Time Formulations 
l„ this, both XV) and , assume a 

later section. 


3.3 Stationary vs Non-stationary Process U 

A process X(t) (Markovian or non-Markovian), is said to be stationary® 
distribution function is invariant under a shift in t, i.e. 
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F(x„ r,; x 2 , r 2 ;...; x„, t „) = F(x,, t\; x 2 , 1 2 ,...; x„, Q 

with t\ = f, + r(l ^ ^ n) for all r and n. 

For a Markov process, stationarity implies that the two-dimensional 
distributions function has the form 

F{x h ti;xjjj) = F(x h xj, t ) ( 6 ) 

where r = — f, (we have assumed that tj > ti). 

3.4 Second-order Characterization 

A second-order characterization involves only the first and second moments of 
X(t ), and is given by the following two functions: 

m(t)=E[X(t)l(teT) (7) 

K(r,n = £[{X(r)-m(r)}{X(n-m(0}],U,r'6D (8) 

If the process is stationary, then m(t) = m and R(tj')= R(t — t'), i.e. the first 
moment is a constant and the second moment is a function of only one 
variable, t = t — t'. 

In general, a second-order characterization contains less information about 
X'(r) than a characterization involving the distribution functions. However, 
when X(r) is Gaussian, the two characterizations are equivalent. 

4 Static Formulations 

As mentioned earlier, static formulations involve algebraic equations with one 
or more of the variables being random. In this section we discuss two such 
formulations. 


4.1 Analysis of Variance Formulation 

Such formulations are used extensively in many disciplines for modelling 
experimental situations involving one or more factors which have some effect 
on the observations. The formulations relate the observed values to the effect 
of these factors. The effects are two types: 

(i) assignable effects which result from recognizable conditions, and 

(ii) unassignable effects which result from unrecognizable and random 
conditions. 

The model formulation is of the form: 


observed value = parameters representing assignable effects} 
+ Z[random variables representing assignable effects} + XJrandom 
variables represent unassignable (residual) effects} 
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We shall illustrate the use of such formulations in solving proble 
considering Case Study B. 

4.2 Case Study B [Cont.J (Alloy Selection) 

• thic nroblem was done in Chapter 3, 

manuTactur^UeuTspccimens tor samples) be obtained to, each combtnat.on 

of ore and method of manufacture. obtained, can be viewed 

The first factor, viz. the mine from which. the on ms »btam ^ ^ ^ 

as a four-level factor, where the term kM ^ . corres ponds to MA, (i.e. 

property defining each categ > f second factor, viz. method of 

me * 

formulations. 


Model B.l J 

Using the notation of the system 

' Assume 'thluheehec, of,he two factors on the thermai coemcen, TC,» 
to be of the form 

TC IJk = A + *, + Cj + (RC)ij + X ijk 

:„ h d mcl h U Vrj.'£— asU^'"i 

£ 15: in , due to 

i^^fnbetweenUwith.evefo, 

' J ' first factor and the jth level of second farton efTec ts. 

X iJk : residual term representing the unassignable (and ranao 

Because R t , C, and (*Q,; represent average deviations from the ovel 4 
average A, we have the following set of constraints. j 

LK«=o, fc,-0 

i=l J =1 
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4 


X (RC)ij = 0 
1= 1 

\<j^2 

(10) 

i (*o,v=o 

1 i s? 4 

(ID 


j= 1 


(Note: If we ignore the term ( RC ) j; - in (9) we are ignoring the interaction 
between the efTects of the two factors, and in this case we have a simpler model 
which is called the “additive" model.) 

We assume that these residual (random) terms are characterized as follows: 

(i) E[X iJk ] = 0 for all /, j and /c, 

(ii) X ijk are mutually independent, 

(iii) var[X l7 *] = o 2 for all /, j and k 

(iv) X ijk is normally distributed. 

To decide whether a particular combination of method and mine ore has any 
significant influence on the thermal coefficient of the alloy can be done by 
testing the null hypothesis that (KC),, = 0 for all / and j against an alternative 
hypothesis that at least one of them is different from zero. Similarly, we can 
decide whether the method of manufacture has significant influence or not, 
and so on. 


4.3 Regression Formulations 

The typical form for such formulations is given by an equation 

Y = ( nx l% x 29 ...x„a)+v ( 12 ) 

where X f (l ^ ^ n) are known variables, V an unknown random variable and 
Y another known variable. The structure of the formulation is given by the 
function </, and 0 is the set of model parameters. 

Such formulations are useful for modelling uncertain systems with “black¬ 
box" characterization, with X t {\ < / ^ n) representing the system input 
variables, Y the system output variable and V representing uncertainty in the 
system. 

Without loss of generality we can assume that the mean of V is zero. As a 
part of specifying the formulation one needs to specify the distribution 
function F(v) for the random variable V. 

Regression formulations are often used for modelling uncertain dynamic 
systems by letting one of the known variables X h 1 ^ i ^ w, to represent the 
independent variable time. An alternative approach is by indexing the 

variables and the indexing linked to time. In other words, the formulation is of 
the form 


Yt = 0iX lh X 2h ...X mh 0)+V t 


(13) 
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with V, representing I'd) a "‘ ‘ k terete time instant I, only. Thus 
relationship involves variables at a 8 involve past values of 

there is no memory effect as the formula'.on "° a , e , ha „ one lime 

the variables. If the relatiims ip i oWe vari.bl.s ^ ^ 

anff non- 

,,„rrr rr;;,Ti"r»r:rea,,, - - • «- 

regression formulation: if not. the formulation IS non-hnear. 

4 4 Case Study 1 [Com.] {Speed of Aeroplanes) 

The system charac.eHra.ion ,'lm^Sof 

S«,ion 6.10. and the prohtem s.o obm.n f , of models 

involvingmgressior^hirmhlationaW^eemploy the notation used m^m: system 

“Sie !t SS 5 --*— ™ ■ 0 

for r < 1900. 

Model J.l (Linear Regression) 

We assume ,h„ the maximum speed achieved a, time , d > 1900. - — * 
X(I) = a, + b,(l - 1900) + HO 

wheren, andb, are noC'^KaoverPnstanis^htn^a iandom^pro«ssJiithzera 

m ^The atKjve'fbrmiilation implies that thejwaxitm^sp^.achieved^coiisists 

since V { in (13) corresponds to ^(f) lor 

,he above formulation the determimam: comprnmm 

ns"*—*— are ^ 

below. 

Model J.2 (Non-linear Regression) Vl 

Model J.2(a) M5) 

X(,).a 1 e-“-'””+PW j 

where n, and hr are non-negative constants and ddt is as in Model). I •>• > “ 
model, appears non-linearl, in the formulation. 
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Model J.2( b) 

log *(0 = 03 + /> 3 (f- 1900)+ V(t) (16) 

where a 3 and fr 3 are non-negative constants and V(t) is as in Model J.l. In this 
model X(f) appears non-linearly. 

The advantage of Model J.2(b) is that by defining 

y(f) = log*(f) (17) 

(16) reduces to a linear regression formulation of form given by (14), with the 
parameters also appearing linearly. The same is not true for Model J.2(a). 


Model J.3 

This is a refinement of Model J.l, in the sense that it incorporates changes in 
technology; i.e. from piston driven to jet propulsion to rocket. More 
specifically, we assume the following: 

(a) For the period (1900-1940) the speed is given by 

X(f) = c l +rf l (r-1900)+K(r) (18) 

with two parameters c x and d x . This period corresponds to piston-driven 
engines. 

(b) For the period (1940-1980) the speed is given by 

XU) = c 2 + d 2 (t - 1940) + V(t) (19) 

with two parameters c 2 and d 2 , and d 2 > d x indicating an increase in the 
rate of change due to jet engine technology replacing the earlier piston 
engine technology. 

Note that c 2 is to be selected such that (18) and (19) yield the same value 
for t = 1940. We assume that d 2 = 5*^ and later check the validity of the 
assumption. 

(c) For the period (1980-2020) the speed is given by 

X(t) = c 3 + d 3 U - 1980) + V(t) (20) 

with two parameters c 3 and d 3 , and d 2 > d 2 indicating a faster rate of 
change due to rocket engine technology replacing the jet engine 
technology. 

c 3 is to be selected such that (19) and (20) yield the same value for 
t = 1980 and we assume that d 3 = 3 d 2 (or equivalently = 15^). 

In other words, in this model the total time interval is divided into three 
regions, each of 40 years duration, and that the rate of growth is separate for 
each period because of different technologies involved. In (18)—(20) V(t) is as 
indicated in Model J.l. 
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5 Discrete State/Discrete Time Formulations 

Le, .he discrete vaiues ,or scares, assumed by XW be deemed 

with r either finite or infinite. Similarly, \e u 0 ,. i. 2 being an 

We also use the notation X(f) and X, interchangea y. 


5.1 Markov Formulations 


The Markov property was discussed in Section 2.1. Formulations with this 
property imply 


prob {X, +1 = SijlX, = Sj . X,. k = s„...,X 0 — s m } 

= prob {X, + 1 = s k |X, = Sj}, 1 *]Xl,m*r. (21) 


As a result, for each time r (f = 0,1,2...) the formulation is completely specified 
by an r x r one-step transition matrix P(t). wit 


P(0 = 


Prl«) 


P.rlOl 

PJt) J 


( 22 ) 


“ ,here p«b(X,. 1 -h,l*.-»i). (23) 

and subject to the following constraints. 


(i) Pj k (t)>0,l<j,k^r 

(ii) t 

k= 1 


, r „„ does no, change with r. i.e. TO - T for all r. then the formulation is 

stationary in the sense described earlier. , y (in a 

The one-step transition matrix characterizes the * 

characterization. This is done by an r-dimens.onal initial vector p( ), 

p(0) = [ Pl (0).p 2 (0),...p,(0)] r (25 1 

W ' th Pi(0) = prob (X 0 = Si}. l<*^ r ^ 

and r .yj. 

P.<0) ^ 0,1 < i < r; I P,<0) = 1 

The evolution of X(t) is given by a r-dimensional vector p(0, 

p(t) = [Pi(0.P2(t),--;Pr(t)]. 


(28) 
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with 

Pi(/) = prob {X, = s,}. (29) 

It is easily seen that p(t) satisfies the linear difference equation 

j*f+l)=P T (/)p(0 (30) 

with p{ 0) as specified above. 

5.2 Non-Markov Formulation 

In this case, X, no longer satisfies the condition given by (21). Let 
P {X | + 1 = S p \ X t — Sj % ..., x t _ ^ == Sj,... X | s m } 

= {^r + 1 = 5 pl^f = s jy • •iXf^k — 5 /} (31) 

where 1 ^ k y j , /, m, p ^ r. Note that the state at time (f + I) depends not only on 
the state at time t but also on the states at times r - n, 1 ^ n ^ k. 

By enlarging the state space suitably, the above non-Markovian r state 
formulation can be transformed into a Markovian formulation with r* states. 
We illustrate this by the following example: 


Example 1 

Let r = 2 and k = 2, i.e.: 

prob ( X, + j = s k \X, = Sy, X,- t — s m% ... X , = s , } — prob { X, + j = s*I X, = Sj. X ,_ j = s m ,. 

(32) 


Define a random variable Y, as follows. 

y,= 1 i(X, = s i% X t », = Sj, 

— 2 if X j — s |, X j _ | - s 2 , 

— 3 if X/ = ,^ 2 . X |_ j = S| i and 

= 4 if X t = S 2 , X f _ j — s 2 • 

Then, from (32) we have 

prob [ V,*, =*l Y, = j. Y,. x = n,... T, = m\ = prob { y, + l =/c| T, = 7 } (33) 

implying that T, is a Markov chain. Hence, we can carry out the analysis of the original non- 
Markov process [A” ( ] by analysing the transformed Markov process J Y, J. 


5.3 Case Study G [Cont.] (Weather Changes) 

The system characterization for this problem was done in Chapter 3, Section 
6.7. The variable of interest is /V(r), which characterizes the weather on day t. As 
indicated in the system characterization, N(t) changes randomly and assumes 
the following values: 

N(t) = 1 if day t is sunny, 

= 2 if day 1 is partly sunny, 

= 3 if day t is cloudy, 

i c. the states of N(t) are given by the set of integers {1,2,3}. 
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We assume that N(t) has the Markov property and that the Process is 

subjecU^wmstra^ts^ve^by^^^he^tial state, viz. N 0 . is assumed to be 
specified. 

Model G.2 

We assume that N(t) has the Markov property but is not stationary^ ln 

sssisiiiss 

constraints. 

Model G. 3 

a^non-Markovian formulation which can be iransformed into a model hav,n 8 
the Markov property with an enlarged state space. 

5 4 Special Case (Random Walk Formulation) J 

The random walk formulation is a special case of the Markov formulation 
discussed in Section 5.1. Here the state space .s^he^t ^ 

T + 7o" - l!or have no change In theltate. As a result the one-step transition 
matrix is a tri-diagonal infinite-dimensional matrix. Such formulations 
been used extensively in modelling a range of physical systems. 

6 Discrete State/Continuous Time Formulations 

A fairly general stochastic process formulation with discrete state and 
continuous time is the semi-Markov process. We start this section with jbne 
introduction to this process. Following this - ^u« a fcw ^h 
formulations - examining their structure and illustrating their 
ling some of the case studies discussed in Chapter 2. 
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6.1 Semi-Markov Process Formulations 

As in Section 5, the discrete values (or states) assumed by X(t) will be denoted 
by s l ,s 2 ,...s„ with r being either finite or infinite. Once the process enters state 
s, (1 < i < r) the transition to next state s ; (1 ^ r.j # /) is uncertain and given 
by probability q (J . These probabilities are constrained as follows: 

% > 0. q u = 0 for 1 ^ ij r 

and 

X <?.;=• (34) 

j= 1 

Thus the transition to next state is completely characterized by an r x r matrix 
Q with elements q (J as defined above. The duration for which the process stays 
in state s„ once it enters it, is a random variable characterized by a distribution 
function which depends not only on the state s, but also on the state s ; - to which 
the process moves. Let r y (1 <i,y's$r) represent the non-negative random 
duration for which the process stays in s, conditioned on the transition being 
to Sj (I < j < r,j ^ i) and let G y (r) be the associated distribution function. Thus, 
the semi-Markov process formulation is characterized by an r x r transition 
matrix Q and r x (r — 1) distribution functions 

Gy(0{ 1 ^i.j^r.i #y'}. 

If the initial state X(f 0 ) is not known exactly, then a probabilistic 
characterization of it needs to be specified. This is done by an r-dimensional 
vector p(/ 0 ) of the form given by (25) subject to constraints as given in (27), with 
p,(/ 0 ) representing the probability of A"(r 0 ) = s„ 1 < i r. 

A special case of this is when all the distribution functions GJt) 
{1 ^ i, j ^ r, i # j} are exponentially distributed, i.e.: 

G t j(t)= 1 -exp(—0 y f) (35) 

then the process X(f) has the Markov property and in this case the formulation 
is called a continuous time Markov chain formulation. 

6.2 Some Specific Formulations 

A variety of discrete state/continuous time stochastic formulations have been 
developed for modelling real systems. Some of these are special cases of the 
general formulation discussed in the previous subsection. We shall discuss a 
few of these formulations to highlight their role in modelling specific types of 
real systems. The formulations that we will discuss are the following. 

(1) point process formulation 

(2) queueing process formulation 
( ) renewal process formulation. 
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6 3 Point Process Formulations 

Point process for—ions M ^ *“ 

of events^an bTvicwcd as points dis.ribu.ed randomly along 

the time axis. formulations can be done in three 

d,^™ w «arereC Cch o.her. I. could be done in .erms of 

either 

(i| the dis.ribu.ion funcion for .he number of events. MO. in an in.erval 
,ii) the^intensity function A<„. w„h W* being the probability of an even. 

Depending on ,he si.ua,,on one of.he above ,hree m.gh, be beuer sm.ed tor 

modelling than the others. , ation j s the stationary Poisson 

The best-known pom, process 'formula on is in ^ ^ ^ ^ 

distributed 1 according ".o'an exponemial distribrnio. with parameter (1/4 

Some extensions lo .he stationary point process ar 
« non-s,a,ionary Poison process - characterized by the intensity function 

„ doubly stochastic Poisson process - characterized by X being a random 
variable (or A(r) being a stochastic process). 

Morecomplica, ed pointprocesses have At., asafune,ion of,he timehis,or,of 

’’“w.'stahremrnmPoisson process formulattons in a later section. 


6.4 Queueing Process Formulations 


tnajueing process for mu, 

“customers" either waiting, or be g Datients waiting to see a doctor 

and service can be planes integer »*«• 

wifh^tO^^implyit^that^ere is no customer being served or waiting to be 
served at time t. • nr leaves If the customer arrival 

- ra " d °- ,hen X(,) chan! “ 
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randomly over time. The nature of change in X(r) depends not only on the 
arrival pattern and the service times, but also on the number of servers (e.g. in 
the case of planes waiting to take off, it would be the number of separate 
runways) and the policy (e.g. first come, first served) used for servicing the 
customers. 

Let F be the distribution function for the inter-arrival times, H be the 
distribution function for the service times and s the number of servers. These 
three variables characterize the queueing process formulation completely. A 
compact notation indicating this characterization is through a string of 
three symbols A/B/C. A represents the form of F. If it is exponential then the 
symbol M is used; if Erlangian then the symbol E is used; and if the distribution 
is general then the symbol G is used. Similarly, B represents the form of H. C 
represents the number of servers. Thus, M/M/s indicates a queueing process 
with F and H exponentially distributed and s servers. X(f) has the Markov 
property only for the M/M/s queue. 

6.5 Case Study K [Cont.] (Supermarket Operation) 

The system characterization for this problem was done in Chapter 3, Section 
6.11. The variable of interest is /V(0, the number of people either in the queue or 
being served at time t. A suitable formulation to use as a dummy in the 
modelling of this problem is the queueing process formulation discussed in 
Section 6.4. In this section we develop two models using this approach. 


Model K.l (Markov Formulation) 

We assume the following: 

(i) The inter-arrival times are statistically independent and identical and 
from an exponential distribution with parameter A. 

(ii) The service times are statistically independent and identical and from an 
exponential distribution with parameter p. 

(iii) The number of servers is s. 

(iv) Arriving customers join a single queue, and as soon as a server becomes 
free the server takes the customer at the head of the queue for service. 

Let N(t) represent the number of customers either in the queue or being 
serviced, and p„(t) be the probability that N(t) = n y n = 0,1,2 — Then we have 

I P„(0 = 1 06) 

n = 0 

Let (= 0 correspond to the time the supermarket opens in the morning. We 
assume that p„(0) = 0 for n^O and p o (0)= 1, implying that there is no 
customer when the supermarket opens. Whenever N(t) < s, one or more of the 
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servers is idle. The number of idle servers a. time. 

equals s - N(f) if N(t) < s. All the servers are .die when N(t) u. 


Model K.2 (Non-Markov Formulation) 

tions P in this model. More specifically we assume the following. 

(i) At t = 0 there are no c V s *°*"* r *j and h subsequent customers 

(ii) The first customer arrives at time X, and .j! ^ 

arrive with inter-arrival times being X*X» ^ ^ ^ and so 

orT'i^T'are 'independent and identically distributed random 

m »—* w.v- 

independent and identically distributed random variables with 

(iv) Thema^fsTrvers (numbered l through s), and as result there are . 
separate queues. 

,_a Thp arriving customer joins the shortest queue. 

(v) 1 he arriving cumuiu j customer oins a queue no 

(Vi) S.Thistsius,ir^ when there are physical barriers which 

prevent customers changing queues. 

. , , w /a i c i < « renresents the queue length associated with server i at 
“ S"o,„d busy otherwise AH,me, the.o,a, dumber 

of people either waiting or being served is N(t), given by. 

N(t) = t fl 

i = 1 

A slightly more refined formulation is one where customm can chanp 
8 a cimnle case is one where a customer moves to the adjacent queue 
?he adjacent queue length becomes smaller than the length of the queue in 
front of the customer. 

6 6 Renewal Process Formulations 

In a renewal process formulation the time interval between events »• 
seuuence of independent and identically distributed random aanab '“” 
distribution fJL ft Such a formulation is 

the operation of unreliable components where an «ve nt coupon ^ 
failure and the instantaneous replacement of the failed comp J 

one. 
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6.7 Case Study / [Cont.] (Component Reliability) 

The system characterization for this problem was done in Chapter 3, 
Section 6.9. We employ the same symbols as used in the system characteriz¬ 
ation and the variable of interest is N(t) - the number of replacements of the 
element in the interval [0, t) with the element being new at t = 0. The elements 
are identical and the failure time is given by a distribution function G. 

Model 1.1 

We assume that the failure is detected the instant it occurs, and that all failed 
items are replaced by new ones instantaneously. As indicated in the system 
characterization, this is an idealization which is justified if the time to replace is 
small in comparison to the mean time to failure. By viewing a failure and 
replacement as an event, we have events occurring randomly over time. Since 
the replacement elements are identical (in a statistical sense) the time intervals 
between events correspond to failure times which form a sequence of 
independent and identically distributed random variables with distribution 
function G. This is best modelled by a renewal process formulation which takes 
the following form when applied to this problem. 

Let T be the time interval of interest and N(T ) be the number of 
replacements carried out. To find the number of replacement elements needed 
it is necessary to obtain the statistical characterization of /V(T), for example the 
distribution function or its moments. Alternatively, if there are n spares and we 
wish to find the time r that these will keep the machine operational, the 
problem reduces to finding the statistical characterization of t. 

Model 1.2 

Suppose that there are no spares and that the failed element can be repaired, 
making the machine operable once more. If we assume that the failure 
distribution of a repaired element is nearly the same as that for the new 
element, we have a repair which is called “back to new". If the time to repair is 
random, then the variable characterizing the system can be specified as 2f(f) 
with 

A^f) = 1 if element is working 

= 0 if failed (38) 

Let H represent the distribution function characterizing the time to repair, and 
G represent the distribution function for the time to failure. Then, in this case, 
*(f) can be modelled by a semi-Markov process formulation discussed in 
Section 6.1 with two states. More specifically, we have: 

(i) s x = l,s 2 = 0 

(iii e -[i j] 


(39) 

(40) 
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(iii) G, 2 (0 = f ( f ) and G 2 i(0 - H(t) ^ 

If Hit) = l lit - P\ with Uiy) = 0 for y < 0 and U(y) = 1 for y > 0, then it implies 
a constant repair time p. 

7 Continuous State/Discrete Time Formulations 

The variables of the formulation ass “ me * ^ 

characterized for discrete values o e l values off correspond to the 

- - *■« - - •••»> 

7.1 General Formulation 

A (airly general formularion is one where X, satisfies an equation 

.X,...« 

.• , P ., S the subscript corresponds to the independent variable time t. 

time, as the subscript c P n Jhe evo , ution of Xj ,j > , is uncertain 

Assume that Xj, 00 J " , w \ Note that if the random sequence 

due to the unknown random sequence { Wj). Note nan. me m 

T'he'Mquence'iX j } < ^neratedby (42, does not have the Markov property as 
X i+ i depends not only on X, but also on i -pJ > 


7.2 Markov Formulation 

If n.m and k in (42) are equal to zero, then we have 

X i+1 =</(X„u„ W,,0,) 


(43) 


ation having the Markov property by an enlarged state spac ^by 
on n.m and k. 

7.3 Time Series Formulation 

There is a da* o((o™ula.ions^hich X, ,s »un«r»tn 
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manner, i.e.: 

X t =Y t +V t (44) 

The changes in F, are given by a linear difference equation formulation of the 
form 

t*jYi-j+ £ PjWi-j (45) 

J m i J m i 

where {W,} is a sequence of independent and identically distributed Gaussian 
random variables with zero mean and variance a 2 . The parameters (a^, 1 < j < n 
and pj , 1 ^m) are constrained so that the sequence {K,} is stationary. The 

constraints which ensure this are that the roots of P(S) = 0 and Q(5) = 0 lie 
inside the unit circle, where 

m = &"- £ <*jd n ~ j and Q(6) = £ Pjd m (46) 

j=i j=\ 

The difference equation formulation for V { is called an autoregressive moving 
average (ARMA) formulation of order n and m. When m = 1 it is called an 
autoregressive (AR) formulation. 


7.4 Case Study L [Cont.] ( Demand for Soft Drinks) 

The system characterization for this problem was done in Chapter 3, Section 
6.12. The variable of interest is X n the demand for the soft drink in period t. In 
this section we build a model for the simplest characterization, where the 
system is viewed as a “black box” with one variable X r The period is a month, 
and X, represents the monthly demand. 

Model L.l 

We assume that X t consists of a deterministic component D t and a stochastic 
component V t . 

The deterministic component D t is assumed to be made up of two 
components: g t , which represents growth (due to population increase) and s n 
which represents seasonal variations (e.g. extra demand during summer 
months). We assume that 



g, = a + bt 

(47) 

and that 




s t = s t-12 

(48) 

Thus, we have 




X t — a 4- bt + s t + V\ 
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From (47) we have the following: 

(ia--0-A)ir.«* (49) 

where A is the backward shift operator; i.e. A 0 , = and 

(ii) m, = /,-/,-,=( 1-A)/,»(1-A) 2 ff, = 0 (50) 

i.e. by differencing, the efTect of growth is removed. 

From (48) we have the following: 

r, = s, — s,_ |2 = (1 — A l 2 )s, = 0 (51) 

i.e. by taking the difference with a lag of 12 (i.e. A ) the effect of seaso 

variation is removed. 

Thus, defining a sequence V, as follows: 

y ( = (l - A) 2 .(I - A I 2 )X, (52) 

. tVm of erowth and seasonal variation and are left only 

m, i.e. Y, is modelled by 

Y, = iajY,.j+ I PW-J (53) 

i= i 

Once the order n and m and the parameters are estimated using the data 
estimates of future values of Y, can be obtained from (53) by assuming W, 

this in turn yields the future values of X, trom pzj. 

Note that we need not carry out the first differencing twice. The differencing 
of (49) leaves a constant b and the effect of this is removed by the differencing of 
(51). Thus, T, defined by 

y i = ( 1 _A)(l-A , 2 )X r < 54) 

is the more appropriate sequence to be modelled by (53) as opposed to Y, 
generated by (52). 


g Continuous State/Continuous Time Formulations 

The variables of this formulation are of the form X(t^X(tM) with both the 
variable X and the independent time variable t assuming a continuous range 
of values. The formulations can be divided into three types base on t e na 
of changes in X(t): 

Type A: The time history of X(t) is a continuous function of It with* 

Type B: The time history of X(t) is a function with jumps at rando 
yP instants and in between jumps the function does not chang^ 
Type C. The time history of X(t) is not only a continuously changing 
y of l but also undergoes jumps at random time instants. 
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Thus in the case of Type B there is an underlying point process which affects 
the variable A^r), whereas in Type A, X(t) is affected by a process which 
changes continuously over time. Type C is a combination of both. 

The two well-known formulations are the Poisson process (Type B) and the 
Weiner process (Type A). A variety of continuous state/continuous time 
formulations have been developed using these as the building blocks. We give 
a brief presentation of these two basic processes and then discuss two other 
formulations. 

8.1 Poisson Process 

This type of process was briefly discussed in Section 6.3. Let N(t) represent 
the count of events, occurring randomly over time, in the interval [ 0 , r) and let 
/V(0) = 0. N(t) is a stationary Poisson process if the distribution of N(t) is given 
by a Poisson distribution function, i.e.: 

prob{/V(f) = fc} = {(/r)*e “*'}/&! (55) 

for k = 0,1,2.Two extensions of this process are: the non-stationary 

Poisson process and the doubly stochastic Poisson process. 

8.2 Weiner Process 

The Weiner process, W{t), is a continuous process which changes erratically 
and has many unusual properties, some of which are listed below: 

(i) W(t) has zero mean; i.e. E{W{t)} =0. 

(ii) W(t) has independent increments; i.e. for t { < t 2 < t 3 < t 4 

E{im 2 )-w(t l mw(u)-mt,)']}= o. 

(iii) W(t) has the Markov property. 

(iv) E{ W{t) Wit ')} = min(f, t'). 

(v) W(t ) has a Gaussian distribution. 

(vi) W(t) is continuous in t but is non-differentiable. 


8.3 Compound Poisson Process Formulation 

This involves a Poisson process N(t) and a sequence Z„ i ^ 1, of independent 
and identically distributed random variables from a distribution function G. 
The process X(t) is given by 

Nit) 

m= XZ, (56) 

1 = 1 

*<*) can be viewed as a sum of random variables and changing at discrete time 
instants which correspond to events of a Poisson process. Alternatively, we 
can view Z, as a mark attached to event i and X(f) as the cumulative sum of 
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marks associated with the events ov€r [lllocase studies^ 

this formulation as a dummy in modelling the following 

8 4 Case Study 1 [Cont.] (Component Reliability) 

Sr,.rTu, warn developed in Sec,Ion 6.7. In ,h,s secuon we bu„d a 
model for component failure due to crack propagation. 


Model 1.3 . 

length L. l ne cracK pi up b , caused bv each shock is non- 

words, the increase in crac c g of the component 

variable. X„ with a distribution function G. Let t be tne age 01 

and V(f) be the crack length by age t. Then we have 

N(i) (Zl\ 

y(r)=I^< (57) 

function F given by 

p( t ) _ | _ f(t) = prob{t > f} = prob{ y ( f ) < 

Thus, .he model use, a compound Pdtao. pro«» as a ^ ^8 £ 
failure distribution F (obtained from (58)) in Model 1.1 we 
solution to the problem. 

8 5 Case Study J [Cont.] (Speed of Aeroplanes) 

Model J.4 . J 
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exponentially distributed with mean (1/A). We assume that the increase in 
speed due to the fth breakthrough. V', is a random variable with a 
distribution G, and that the increases are statistically independent. In this case 
the maximum speed achieved at time r is given by 

mu 

X(f) = I y. (59) 

i= 1 

if N(t) is the number of breakthroughs in the interval [0, r]. 

8.6 Differential Equation Formulations 

A fairly general Markov process formulation is given by a differential equation 
formulation of the form: 

d X(t) = J\(X(t))dt + f 2 (X(t))dW(t) + UX(t))dN(t) (60) 

where W(t) is a Weiner process and N(t) is a Poisson process. The initial 
condition ^(0) can be random and needs to be specified probabilistically. The 
functions /,, f 2 and / 3 must be constrained to ensure the existence of the 
solution (in a stochastic sense). 

The differential equation is linear if the functions /, ,/ 2 and / 3 are linear in 
their arguments; if not, it is a non-linear differential equation. The above 
formulation can be viewed as the stochastic counterpart of the deterministic 
ordinary differential equations formulations studied in Chapter 5, Section 5. 

Note that in the formulation given by (60) we have X(r) changing 
continuously due to the first two terms on the right-hand side of the equation 
and undergoing jumps due to the last term. Thus X(0 is of Type C. 

Notes and References 

Our presentation has been very terse and introductory. There are many 
excellent textbooks which cover a range of stochastic formulations and their 
applications; [l]-[8] are a small sample. 

Regression formulations have been used extensively in the modelling of 
economic systems. Further details of this type of formulation can be found in 
[9]-[ll] or in any book on econometrics (e.g. [12] and [13]). 

There are a large number of books dealing with specific stochastic 
formulations and their applications. We give a small list: [14] and [15] deal 
with Markov chains; [16]—[18] deal with time-series formulations; [19]—[21 ] 
deal with point processes, with the last reference a collection of papers dealing 
with both theory and applications, and [22]-[24] deal with queues. 

A proper study of continuous time Markov processes requires a rigorous 
approach. A classic book in this mould is [25], and since then a large number 
of books have appeared on this topic (e.g. [26] and [27]). For more on 
stochastic differential equations, see [28]—[31]. 
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We have not discussed many other types of stochastic formulations for 
example, random integral equations [32]; branching processes [33]; random 
fields T341; and storage processes [35], to name a few. 

There are a large number books on the application of stochastic formul¬ 
ations to modelling problems from specific disciplines; [36]-[45] are a small 
sample. 
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CHAPTER 8 


Analysis of Mathematical 
Formulations- II 


1 Introduction 

In Chapter 6 we discussed the analysis of the underlying mathematical 
formulations used in models of deterministic systems and examined the variety 
of techniques available. The techniques of analysis for stochastic formulations 
are different from those for deterministic formulations. In this chapter we 
examine techniques of analysis for the stochastic formulations discussed in 
Chapter 7. As indicated in Chapter 7, stochastic formulations belong to the 
theory of stochastic processes. As such we start with a brief general discussion 
on the analysis of stochastic processes. Following this we discuss techniques of 
analysis for some specific stochastic formulations. 

2 Analysis of Stochastic Processes 

Let *(/)( = XU cu)) be the variable of the stochastic process which evolves with 
r according to 'the underlying mathematical formulation defining the process 
X(r). In this section we first discuss different types of analysis and later we 
discuss two different approaches to analysis. 

2.1 Probability Distribution Function 

An important problem in analysis of stochastic processes is to obtain the joint 
distribution function FfXj, I ii x 2 , f 2 ;. ••, ^ S' vcn by 

F(x 1 ,t 1 ;x 2 ,t 2 ;...;x B ,0 = probWt 1 Kx 1 ;X(t 2 Kx 2 ;...;X(t,Kx J ,} (0 

for arbitrary n. If the process has the Markov property we need to 
distribution function for n = 2; i.e. F(x „ f 2 ; x 2 , r 2 ) for arbitrary xi, * . T’ na j 

From this, the joint distribution function (1) can be obtained using co 
argument as indicated in Section 2.1 of Chapter 7. 
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2.2 Moment Analysis 

The joint distribution function completely characterizes the probabilistic 
evolution of X(t) with t. Often one is content with only moment characteriz¬ 
ation. Let Mp) represent the jth moment of X(t ), given by 

Mj(t) = E[X\t)l j= U,... (2) 

In particular, the first moment M x (t) and the variance o 2 (t) are of special 

interest. The variance o 2 (t) is related to the first two moments as follows: 

<r 2 (f) = M 2 (r)-M?(r) (3) 

Mi (0 characterizes the mean of *(0, and the variance o 2 (t) characterizes the 
dispersion in A'(r) about the mean. 

Mjit). as defined by (2), does not incorporate the relationship between X(f) 
and X(t) for i ^/. Moments which incorporate this aspect are given by 

Cj»(I,t) = E{TO (4) 

I ^ j s k < oo. When j = k = 1, C x j(f,T) is called the covariance function and it 

characterizes the statistical correlation between X(t) and A'(r). In the case of a 
stationary stochastic process, we have 

C jk (tyT) = C jk (t-T) (5) 


2.3 Steady State Analysis 


In steady state analysis one is interested in the analysis of the process as t -► oo. 
If the process is stable, then as r —► oo, ^(f) reaches steady state and the 
distribution function characterizing X(t ) becomes independent of f, i.e. 


As a consequence 


F(x, t)-> F(x) 


( 6 ) 


(7) 

where M jy 1 < oo, are constants. In general, it is much simpler to obtain Mj 

than Mj(t). 


2.4 Analysis of Associated Variable 

Often the variable of interest is not A'(f) itself, but some other variable Y (or 
YU)) which is related to A^f') We shall call such a variable an 

associated variable. Such variables are important in many contexts, as will be 
illustrated through case studies later. 

In cases where A^r) assumes only discrete states ( s x s 2 ... s n ) examples of 
associated variables are: 
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(i) the number of visits to a state s, in a specified time period starting fro 

some specified initial state; t fl _. ,. h time 

(ii) the time taken for the process to return to its starting state for thejth 

(iii) ihJ number of times the process visits two states in succession in a 
specified time period. 

In the case where X(f) assumes a continuous range of values (- oo < X(t) 

< oo), examples of associated variables are: 

<il the number of sign reversals in a specified time period; 

(ii) the fraction of the time for which X(r) exceeds a value b m a specified time 

interval; and . , 

(iii) the first time X(r) becomes zero or crosses some level b. 

Sometimes it is possible to obtain a complete probabilistic characterization of 
^associated variable. When this is not possible, one resorts to analysis 

through moments. 

2.5 Two Approaches to Analysis 

Two approaches to analysis of stochastic process formulations are analytical 

'"lnThe analytical approach the distribution function, or the moments, of X(t) 
(or an associated variable V(0) are obtained using analytica techniques. 
it involves solving either difference, or differential, equations (ordmary or 
partial) Only for a small class of formulations can these equations be solved in 
closed form. When this is not possible a computational scheme; must be 
employed A variety of techniques have been found to be useful in the 
analytical approach. In particular, Laplace transforms and moment-generat¬ 
ing functions have proved to be extremely useful for a wide range of 

^When'iTis difficult to obtain exact analytical solutions one can resortto 
obtaining analytical expressions for lower and ^ 

of interest. For a variety of formulations such bounds can be obtained wit 
considerably less effort than that required for the exact solution. 

Jn the simulation approach the stochastic process is simutoted.on a 
computer to generate individual time histories (also called^^“Jndent 
nrocess. By repeating the simulation a large number of times, 
statistical information about the process is obtained to yiel estum es ^ ^ 
the distribution functions (or the moments) of the variables of inte 
this approach can be viewed as computer experimentation. . Tta 

In this chapter we will confine our attention to the ana y tea pp i 

simulation approach will be discussed in Chapter 9. 
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3 Static Formulations 

In Section 4 of Chapter 7 we discussed two types of static formulations - 
analysis of variance and regression. The analysis of variance formulations will 
be discussed in Chapter 11, and in this section we discuss briefly the analysis of 
regression formulation. 


* 3.1 Regression Formulations 

The structure of such formulations is given by the relationship 

Yi = g(X u . X ni ;9)+V i ( 8 ) 

where X#(\ 1) are known variables and MO ^ 1) are unknown 

random variables with zero mean. An important problem in analysis of such 
formulations is to obtain the probabilistic characterization of T, + / (/ > 0) given 
Y k ( 1 ^k^i) and X jk (\ ^n;\ ^k^i + /). 

If MO ^ 1) is a sequence of uncorrelated random variables, then 

prob {1 ^k^i;X jk , 1 1 <*«/+/} 

= prob [ T,‘+/ + ju+D' 1 (9) 

As a result, we have: 

F{Ynr t } = g(X w + n . X n{i ^9) ( 10 ) 


and 

var (T,+/} = var {F l+/ } (11) 

However, if V£i ^ 1) is a correlated sequence, then (9) is no longer true. The 
characterization of V, +1 will depend on the nature of the correlation between V { 
and Vj(i ^ j). We shall discuss this aspect later when dealing with the analysis 
of discrete time/continuous state formulations. 


3.2 Case Study J [Cont.] ( Speed of Aeroplanes ) 

In Section 4.4 of Chapter 7 we formulated three models (Models J.l to J.3) 
involving regression formulations for predicting the maximum speed of 
aeroplanes. In this section we consider the analysis of Model J.3. 


Model J.3 

The maximum speed achieved by time t is a random variable X{t) given by 
equations (18)—(20) of Chapter 7. Since the uncertainty in X{t) is solely due to 
M0, an uncorrelated random sequence with zero mean and variance a 2 , we 
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have the £[*(0] and the var[X(r)] given by 

c , + </,(!- 1900), 1900^r<1940 

£[X(f)] =c 2 + d 2 (t - 1940), 1940 s$ t s* 1980 (12) 

t-3 + diU- 198°), 1980 < t ^ 2020 

and var[X(t)] = ff 2 . 1900 < t < 2020 (13) 

In other words, the mean is given by a function which is piecewise linear and 
the variance is constant. 


4 Discrete State/Discrete Time Formulations 

In Section 5 of Chapter 7 we discussed discrete state/discrete time formul¬ 
ations where the discrete states assumed by X , are denoted by s,, s 2 ,..., s r and 
the time variable! assuming integer values. In particular, if X, has the Markov 
property, then the formulation is specified by an r x r one-step transition 

matrix P(f): 


where 


P(t) = 


PnU) 

£„(') 


£;*(') = prob{A', + 1 


• />„(')] 

• PrrU) J 

= s k \X, = Sj } 


(14) 


for 1 k < r. The elements of the matrix are subject to the constraints given 

by equation (24) of Chapter 7. .... . . 

The initial state X 0 is specified through an r-dimensional probability vector 

p(0) with 

p ( (0) = prob{Xo = s i }, 1 


and the elements of p(0) subject to the constraint given by equation (27) of 
Chapter 7. In this section we discuss some aspects of the analysis of such 
formulations. We confine our attention to the case where r is finite. 


4.1 Evolution of X, 

Let p(t) be an r-dimensional vector with 

P T {t)= Ol(0.--Pr(0] 

which characterizes the evolution of X„ i.e.: 


p,{t) = prob {X, = s,}, 1 ^ i < r 

Then, we have 

p T(t + \) = P(t)p T (t) 


(17) 

(18) 


for f>0. Using the initial probability vector p(0) as the initial 

the above equation can be solved recursively to obtain p(f) for t J* 0. Note in 
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p(t + 1) can also be rewritten as 


P T (t+l) = P[t,t')p T (t') 

(19) 

for 0 < t' < r, with P{t,t') given by the relation 


P(t,t')= ft P(j). 

i-t' 

(20) 

P(t. O can be viewed as a (t + 1 — f')-step transition matrix which characterizes 
the transition from time t' to time (t + 1). When the chain is stationary, i.e. 
P(t) = P for all f, then 

p(t,n=p (, - n 

(21) 


4.2 Steady State Probabilities 

The steady state behaviour of X„ i.e. the behaviour of X, as t -> oo, for the case 
where the chain is stationary, depends on the structure of the one-step 
transition matrix P. 

In this context the decomposition of the states into transient and recurrent 
plays an important role. A state /' is said to be transient (recurrent) if the 
probability of the process not eventually returning to the state is non-zero 
(zero). The recurrent states can be grouped into different equivalence classes 
with the states belonging to an equivalence class being accessible to each other. 
If there be no transient states and the recurrent states belong to one single 
equivalence class, then the chain is said to be irreducible; if not it is called a 
reducible chain. 

For an irreducible stationary Markov chain, as f->oo, p(r)->p*, a steady 
state vector which satisfies the following relationships 

p* = p T p * (22) 

with the elements of p* summing to one, i.e. 

t Pt = 1 (23) 

1 = 1 

The steady state analysis for a reducible chain is more involved. 

4.3 Analysis of an Associated Variable 

We shall confine our discussion to a study of an associated variable called 
occupation time". This variable refers to the number of times the process X t 
occupies a certain state in a given time period. 

Lct N }" j> be the nurT >ber of times X, visits state s, in a time interval n, given 
1 at initially it was in state s ( . If the chain is stationary and irreducible, then we 
nave the following asymptotic result - i.e.: 

lim Af'jVn -* pf 


(24) 
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where pj is the jth component of p* given by (22) and (23). Note that the initial 
starting state has no effect on this probability. 

4.4 Case Study G [Cont.J ( Weather Changes) 

In Section 5.3 of Chapter 7 we formulated three different models (Models G.l 
to G.3) for modelling weather changes. In this section we carry out an analysis 
of Model G.l. 


Model G.l 

Since there are only three states (states 1, 2 and 3 corresponding to sunny, 
partly sunny and cloudy weather), the one-step transition matrix is given by a 

3x3 matrix: 



~ p xx 

P 12 

1 -Pll 

- P 

p = 

P 2 > 

P ll 

1-P 2 , 

- P 


p sx 

P S2 

\~Pyx 

-P 


12 


(25) 


with p (1 ^ i ^ 3 1 ^j ^ 2) being non-negative variables constrained such that 
the last entry in each row is also non-negative. Note that, in the way we have 
written the matrix, each row sums to unity. 

Since the weather can change arbitrarily from one state (i.e. sunny, partly 
sunny or cloudy) to any other, the assumption that the chain is irreducible is 
justified. Our interest is to obtain the expected number of sunny and partly 
sunny days in a year. This corresponds to obtaining the expected number of 
visits to states 1 and 2 respectively over a year. As there are 365 days in a year, 
and since this is a large number, we can use the asymptotic results of Section 
4.3 to yield: 

£ {sunny days in a year} ~ 365 p? (26) 

£ {partly sunny days in a yearl} ~ 365 p* 

pf and p$ arc given by: 

(l-P t2 )pf+0-f > ii)P2=(l- p n- p i2) (27) 

(1 - P 22 )pT + 0 -P2,)Pl=U- P 21- p 22)- 

(Note (27) is obtained using (25) in (22).) 

5 Queueing Process Formulations 

In the queueing process formulation the variable X(t) represents the number of 
customers, either being served or waiting to be served, at time t. In Section • 
of Chapter 7 we discussed the underlying structure of the queueing process 
and defined four different types of queues - M/M/S, M/G/S , * 

C/G/S, respectively. In this section we discuss the analysis of some o 

processes. 
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5.1 Evolution of X(t) 

In the case of the M/M/S queue, A"(f) has the Markov property. As such it is not 
too difficult to obtain the probabilistic characterization of A"(t) as it evolves 
over t. As will be shown later, this probabilistic characterization can be 
obtained as the solution of a set of coupled linear ordinary differential 
equations. The steady state probabilities, if they exist, can also be obtained 
fairly easily. 

In the case of M/G/S or G/M/S or G/G/S queues, A"(f) no longer has the 
Markov property; as a result the analysis of such formulation is more difficult. 
We shall briefly discuss the analysis of the G/M/S queue as an example of this 
class of problems. 

For the G/M/S queue one can use essentially two different approaches. In 
both the analysis is reduced to a study of a formulation having the Markov 
property. In the first approach we define a variable T(f) which represents the 
time that has elapsed since the last arrival. Y(t) is a sawtooth function 
increasing linearly with t and becoming zero with the arrival of each customer. 
A vector Z(f), defined thus: 

Z(t) = {X(t),Y(t)} T (28) 

can be shown to have the Markov property, and hence is easier to analyse than 
X(f). In the second approach the queue is studied in terms of A"(f) only at time 
instants at which a customer arrives. Let X„ represent A"(f) at the time instant 
of the arrival of the nth customer. Then it can be shown that the sequence X„, 
I. has the Markov property. More specifically, X n can be viewed as a 
Markov chain with the states being the set of non-negative integers. 


5.2 Case Study K [Cont.] (Supermarket Operation) 

In Section 6.5 of Chapter 7 we discussed two models (Models K.l and K.2) 
involving queueing process formulation for modelling the operation of a 
supermarket checkout counter. Model K.l involved a M/M/S queue formul¬ 
ation, whilst Model K.2 involved a G/G/S queue formulation. In this section 
we carry out an analysis of Model K.l. 


Model K.l 

The characterization of customer arrival and service and the queue discipline 
are given by assumptions (i)-(iv) of Model K.l in Section 6.5 of Chapter 7. 
et A(r) represent the number of customers either in the queue or being 
at time t. N{t) is a random variable which can assume only non-negative 
eger values. N(t) = 0 corresponds to all the servers being idle and N(t) ^ s 
•responds to all the servers being busy at time t. Let p„(f) be the probability 
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that N(t) = n,n = 0,1,2...; then we have 


i/Wl-l (29, 

n = 0 

Let N(t) = n Then at time t + St either there is no change in N(t) or it can either 
increase o^ decrease (provided N(f) # 0). Because of the inter-amva. and 
service times being exponential we have, conditional on N(t) - n. 

n + 1 with probability /.St + o(St) 

N(t + St) = n with probability (1 - A<$l — p n St) + o(St) (30) 

n _ i with probability p n St + <>(<5l) 

HU) changing by two or more in St is <>(^0 and can be ignored as St -»0. 
Since there are s servers operating in parallel the variable p„ is given by. 

Pn = sp for n > s (31) 

= np for n s* s 

As a result we have for n > 0: 

p„(t + St) = prob {N(t + d) = «} = P„-i(0{A5t) 

+ P„(f){ 1 - 'St — Pn^t} + Pn+ l(0{P<l + l I + ^2) 

We have used the conditional argument in obtaining (32) where the first term 
corresponds to N(t) = n- 1 with one arrival and de P arture -^ e se ^ 
term to N(t) = n with no arrival or departure and the third term to N(t) = n + 1 
with one departure and no arrival. Taking the limit, we have: 

p’(f) = dp„(f)df = - (/ + P„)P„U) + 'Pn- |M + Pn+lP** At) 

Similarly, we have for JV(f) = 0: 

p'o(f) = dp o (0/df = - Apo(f) + PPi(0- 

We assume that f = 0 corresponds to the instant the supermarket opens in 
the morning. Since there are no customers before the supermarket opens, we 

have: 

p„(0) = 0forn*0 (J5) 

and 

Po(0)=l 

These define the initial conditions for (33) and (34). 

Steady State Probabilities 

Under certain conditions, for large r, p„(f). O 0, approach steady state vai^| 
p * which are independent of t. More specifically: 

„oi_»pJ as f-KXi 


(36) 
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These can be obtained from (33) and (34) by setting the derivatives to zero and 
solving the resulting system of linear algebraic equations. It can be shown 
(after some algebraic manipulation) that 


and 


(,r = o r! s\ j 

* _ f (sp)”Po/ n! for « < s 
P " j fp’p^/sl for n^s 


(37) 

(38) 


where p = X/sp. Note that the condition for the existence of this limiting 
distribution is given by 

p = X/sp < I (39) 


Expected Queue Length 

Under steady state conditions (i.e. the process has been operating for a long 
time and that (39) is satisfied) let QL represent the number of people waiting in 
the line and L the expected value of it. Then, we have 

L=E[QL]= £ (n — s)p* (40) 

n = s + I 

for there is a queue only when the number of customers exceeds s. Using /?* 
from (37) and (38) we have 

L = ppf/(\-p) 2 (41) 

indicating that the average queue length increases to infinity as /)-► 1. 


Waiting Time For A Customer 


Let the waiting time for a customer under steady state conditions be denoted 
by W and let 

F(x) = prob| W <x} (42) 

Since there is no waiting if the number of customers is less than or equal to s, we 
have 

F(0) = probjyV(r) < s} 

= P* l' M7n! (43) 

11 = 0 

When the number of customers N(t) = n and n is greater than s, then there are 
bi-.v) waiting to be served, and in this case we have 


prob{.v < W < x + Sx\N(t) = n} 


c' SfiX (spx) n ~ 5 


n — s 


(sp)dx 


(44) 
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This follows as W is the sum of the service times for (n - s) customers with s 
Lrve's Sinil service .imes are exponent .he server has a gamma dens,., 
function as indicated above. Since: 

prob{N(f) = n} = p* 


we have 


dF(x) = £ p? 


‘(spx)" 


(sp)dx 


(45) 

(46) 


(46') 


(n - s)! 

which on simplification gets reduced to 

dF(x) = spp* e ' ,w ‘ “dx 

From (43) and (46') we have the expected waiting time for a customer given by 

2 , (47) 

E[W ] = xdF(x) = p*![s\x(\ - p) ] 

Jo 

again indicating that the expected waiting time increases to infinity as p-1. 


6 Renewal Process Formulations 

As indicated in Section 6.6 of Chapter 7, the renewal process is a special type of 

renewal point then the analysis is more comp 1 rn ft and let X 

1et N(D reoresent the number of renewals in the interval [0,0 and let X, 

*. **-»-*-«*>«■ ■ w * ,reai ,he 
renewal at the origin I = 0 as the zero-th renewal. 


6.1 Analysis of N(t) 

The crucial step in the analysis is the following statement: 

N(t) n if and only if S„ < f 

where S„ is defined as follows. 

i = 1 

Since S is the sum of n independent and identically distributed ra 
varShtes/the dis.ribu.ion fu„c,io„ of S. is .he e-fold con.oluuon of <M 

with itself. 


(48) 


(49) 
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From (48) we have 

p„(r)= = prob{ N{t) = n} 

= prob! N(t) > — prob{ /V(r) > n + I) 

(50) 

= prob{S„ ^ /} — prob{S n+ , 

= W„(0-W B+1 U) 

6.2 Expected Number of Renewals 

Let MU) represent the expected number of renewals in [0.0, i.e.: 

M(f) = £[N(0] (51) 

One way of obtaining M{t) is by using the relationship 


M(t)= £ np„U) 


(52) 


An alternative approach is via the conditional argument based on the 
observation that the renewal process repeats itself after each renewal. Let the 
first renewal occur at X, = x. Then conditioned on this, we have 


M{t\X t =x) = 0 

= M(t - x) + I 


if x>t) 
if x < t J 


By removing the conditioning, i.e. 


MU) 


-r. 


M(t IX, = x)dG(x) 


we have the result 


M(t) = G(t) + 




x)dG(x) 


(53) 


(54) 


(55) 


This is a linear integral equation which can be solved analytically for simple 
forms for G(x). In general, one would need a computational technique to 
obtain M(r) from (55). 

In the limit t-> x, we have the following asymptotic result. 


lim M(t)/t \/p 


where 


P = £[*] 


■f 


xdC(x) 


(56) 

(57) 


b.3 Case Study / [Cont.] (Component Reliability) 

In Section 6.7 of Chapter 7 we discussed modelling failure and replacement of 
an unreliable control element and formulated two models (Models LI and 1.2). 
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Model LI involved a renewal process formula,,on and Model 1.2 involved an 
alternating renewal process formulation. In this section we V 
analysis of Model 1.1. 


Model 1.1 

We shall assume that the failure distribution G(x) is given by 

G(x) = 0(l -e-'’*) + (l- 0)0-e - "*) 


(58) 


i e. it is a mixture of two exponential distributions. The density function g(x) 
(= dG(x)/dx) is given by 


g (x) = 0 Pi c-' , ' x + (\ -0)P2^ I ’ 2X 


(59) 

To obtain the expected number of replacements over the int « v ^0 we 
need to evaluate M(r) given by (55). Denoting the Laplace f 
G(t ) and g(t) by M(s), G(s) and g(s) respectively, we have from (55) 


M(s) = G(s) + M(s)$s) 

(1 -0)p 2 


-h 

S Pi + s 


} 


From (58) and (59) we have 

f Op i 

and - a, v, 

G(s) = g(s)/s 

Substituting (61) and (62) in (60) and simplifying yields 


M(s) 


1 \ a 2 — p 2 0(1 -0)(Pi-P 2 ) 

= n-r-T- zm r”v 




s 2 p 


s?(s-s,) 


where 


S[ = — {(1 - 0)Pi + 0p 2 j 


(60) 

(61) 

(62) 

(63) 

(64) 


and „ and o’ are the mean and variance of X. On taking the inverse Laplace 
transform, we have 

(65) 


r a 2 -p^_ 0(1 — G)(p, -Pi) 2 e _ 3 




Note, that as t - oo we obtain the relationship given by (56). 


7 Continuous State/Discrete Time Formulations 

In Section 7 of Chapter 7 we discussed a fairly general our 

state/discrete time formulation. However in this section we 
attention to the analysis of a special formulation discussed in Section 

Chapter 7. 
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The variable X, of the formulation is given by the relation 

X,= Y,+ V, ' ( 66 ) 

with Y, a known deterministic sequence and V, an unknown stochastic 
sequence generated by a linear difference equation of the form 

K = i XjK-j+ i PjW,_j (67) 

j =I j=0 

with W { being a sequence of independent and identically distributed 
Gaussian random variables with zero mean and variance a 1 . 

Since the Gaussian property is preserved under linear operation, K is a 
correlated Gaussian sequence, the probabilistic characterization of V, can be 
given through its first two moments. We shall confine our attention to V, being 
a stationary process with the first moment (viz. £[K,]) = 0 and the second 
moment (viz. £[K f K f +J) given by 

vO) = £[K,K f+ ,]. (68) 

As a result, X, is also a correlated Gaussian sequence, with 

£[*,]= y, 

and (69) 

£[(*,- W] = v(y) 

Expressions for v(j) can be obtained in terms of the parameters of the 
difference equations given by (67). The v(j) values satisfy a set of linear 
algebraic equations known as Yule-Walker equations. If m = 0(i.e. V, is a MA 
process) then v(j) is identically zero for \j\^m. If n #0 (i.e. V, is either an 
autoregressive process if m = 0 or an autoregressive moving average process if 
m 7*0) v(j) is non-zero for all j and when j becomes large it is a decreasing 
sequence in j. 

The behaviour of v(j) as a function of j plays an important role in the 
selection of n and m of (67) in time series modelling. We shall discuss this aspect 
in Chapter 16 where we study in detail a case study involving time series 
modelling. 

8 Continuous State/Continuous Time Formulations 

The continuous state/continuous time formulations were briefly discussed in 
Section 8 of Chapter 7. In this section we confine our attention to analysis of 
t*o specific types of formulations, both of which have Markov property. The 
•rst formulation involves a Weiner process and the results of the analysis will 
used in the next chapter in the context of Monte Carlo method for solving 
parabolic partial differential equations. The second formulation is of relevance 
,n the context of Case Studies I and J. 
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8.1 Differential Equation Formulations with 
Weiner Process 

A general formulation is given by X(r) satisfying the following differential 
equation 

dX(t) =/,(X(f), t)df + / 2 (X(f). t)d W (70) 

where W{t) is the standard Weiner process. The initial condition X(0) is 
uncertain and characterized through a density function p(x,f 0 )v , e - 

prob {x < X(0) x + <5x} = P(x, 1 0 )<5x + o(dx) (71) 

and we assume that XU) is unconstrained. 

Let p(x,f) be the density function characterizing X(f), i.e. 

prob {x < X(f) i£x + <5x) = p(x , f)<5x + otfx) (72) 

Then it can be shown (we omit the details) that p(x.r) satisfies the following 
partial differential equation: 

dp{x : t l = i{i[/Kx. t) p(x. i)] - ~ [/, (x. r) p(x. r)]} (73) 

dt 2\<V U2 3x ) 


with 

and 


I.C.: p(x,f 0 ) as specified 
B.C.: p(x,f)-»0 as |x| 


00 


(74) 


Note that the B.C. will change if the process is constrained; e.g. X(f) must 

always lie in the interval [a, />]• . „ _ . . , „„ 

In general, it is not possible to solve (73) analytically. For the special case 


/ 2 (x.f) = ff 2 ; /,(x,f)= -fix and p(x,t 0 ) = d(x) 

an analytic solution exists and is given by 

1 


p(x, t) = 


where 


T^du) 


>-x 2 !2D{t) 


£)(r) = er 2 ( 1 -e~ 2p ')/2P 


(75) 


(76) 


(77) 


As t - co. D{t)-*o 2 /2p and p(x.f)-p*(x), the steady state probability density 

function for X(t). . ,,nder 

Note that we could have obtained (76) more directly by noting that unde 

(75) the differential equation is linear, as a consequence of vthuAXW 
Gaussian. Thus X(f) is completely characterized through its fir 
moments. The first moment is zero and the second moment M 2 (0 S • j 
the equation 


d M 2 (t)/dt = — 2/iM 2 (f) + <r 2 
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with 

M 2 (0) = 0 (78) 

This yields M 2 (f) = £>(/) given by (77). 


8.2 Compound Poisson Process Formulations 


The compound Poisson process formulation was discussed briefly in Sec¬ 
tion 8.3 of Chapter 7. The variable X(r) is given by 

mn 

*U)= I Z, (79) 

I = 1 

where Z, is a sequence of independent and identically distributed random 
variables with a density function of y(z), and N(t) is a stationary Poisson 
process with parameter /. and independent of Z ( . 

Let p(x. t) denote the density function of X(r), and let p(x, f|n) be the density 
function of X(r) conditioned on N(t) = n. 

Conditioned on N(t) = n. X(r) is a sum of n independent random variables, 
as a result of which we have p(x, f|n) given by a w-fold convolution, i.e. 


p(x,t\n) = g{x)*g(x)*... *g(x) (80) 

where * is the convolution operator. On removing the conditioning, we have 

X 

P(x,t)= £ p(x,r|n)prob{/VU) = n} 


= f 

n = 0 M! 

Using characteristic function <b(O y t) defined as 


- Al 


we have it equal 


where 


<D (0 y t) = E{c i0m } 


= I V{(0)}' 

n = 0 


,| (/t)"e- ;, j 


T(0) = £{e'«--}= p e ie: g(z)dz 

J - 00 

and from this we have the following result. 


dp(xj) 

dt 


= - /.p(x, f) + / 


j p(y, t)g(x - 
J - ao 


y)dy 


(81) 

(82) 


(83) 


(84) 


(85) 


Thus the probabilistic characterization of X(f), i.e. p(x, r), can be obtained by 
fixing (85). 
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The First Passage Time 


Let t represent the time for X(f) to reach or cross a barrier at x = L. We shall 
confine our attention to the case where the V; values are non-negative. In this it 
is impossible for the process X(f) to recross the barrier and hence 



r >t if and only if X(t ) < L 

(86) 

Let q(v, L) denote the density function of r. Then from (86) we have 



j q(v, L) dx = J 

j p(x,f)dx 

(87) 

or 

f f 

C L 



1 - q(v,L) dx = 

p(x, t)dx 

(88) 


o Jo 


Using transform techniques it is possible to obtain expressions for q(v, L); we 
illustrate this in a later subsection. 

8.3 Case Study J [Cont.] (Speed of Aeroplanes) 

In Section 8.5 of Chapter 7 we formulated two models (Models J.4 and J.5) for 
predicting X(f), the maximum speed of aeroplanes. Both these models use 
compound Poisson process formulations as dummies. In this section we carry 
out an analysis of Model J.4. 


Model J.4 


We assume that the density function characterizing the changes to the 
maximum speed with each breakthrough is exponential, i.e. 

y(z) = (89) 

As a result the density function for X(r) conditioned on N(t) = n is given by (80). 
On taking the Laplace transform with respect to .x we have 

0(s,f|n) = £{p(x,t|n)} 

= [</(*)]” = {p/(p + s)}" 


Using this in (82) gives 

(90) 

A + s/ 

On taking the inverse Laplace transform, we have 

p(x, t) = e~ A, 6(x) + ^JipXt/x) e ■ <■*' * **• /,(2 v /pxAfj (91) 

where <5(x) is the delta function and /,(.) is a Bessell function of imaginary 
argument. 


p(s,f) = £{p<x,f)} = i (-M 


(Af) e' 


= e 


'exp 
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From this, expressions for the various moments of AV) can be obtained. In 
particular, the mean M(t) and the variance cr 2 (t) are as follows: 


and 


<r 2 (r) = Af(l + p)/p 


(92) 


From (92) we see that both the mean and the variance of X(r) increases 
linearly with t. Note that we could have derived the expression for M(t) more 
easily by noting that on the average one breakthrough occurs every 1/A units 
of time, and the average increase per breakthrough (= E(Z )) is 1/p. As a result, 
at time t the mean value of X(t) is given by the expression in (92). 


gj Case Study / [Cont.] (Component Reliability) 

In Section 8.4 of Chapter 7 we formulated a model (Model 1.3) for characteriz¬ 
ing the time to failure. The variable of the model, X(f), represents the crack 
length. X(t) changes as described in Section 8.3 and failure occurs when it 
crosses a level L. In other words, the time to failure is given by t, and we seek 
the density function q(i\L) for it. 

We shall assume that the increase in crack length with each shock is given by 
an exponential density function, i.e. y(z) is given by (89). 

The density function q(v, L) can be obtained from (88). On taking the 
Laplace transform with respect to f, we have 


where 


{1-4(a.L)}/a = J pix, 
• L, = |< 


a)dx 


4(a 


q(t,L)e~ a ‘dt 


p(x,a) = J 


p(x,r)e °'df 


Rearranging (93) yields 


4(a,L)=l-aJ t /5(x,a)dx 

Taking Laplace transform with respect to L we have 

q(a,s) = {1 — p(s,a)}/s 


where 


r,s)= j 4(a, 


L)e~ sL dL 


a)e _M dx 


(93) 


(94) 


(95) 


(96) 


(97) 
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*5. a) can be obtained from (90) by taking the Laplace transform with respect 
to f. On substituting this into (96) we have 

q(a. s) = A/{pA + an + as} ^ 

On taking the inverse transforms, we have 

where /<>(.) is a Bessel function of imaginary argument. This gives the 
probability density function for the time to failure. _ 

From (99) we can obtain the various moments of r. In particular, the first 

moment (or the mean time to failure) is given by 

£[ t] = j* vq(v. L)dt> = (1 + L)/n- ^00) 

Similarly, other moments can be obtained. 

Notes and References 

Our presentation has been very terse. For further details of analysis of different 
types of stochastic processes see the books mentioned in Chap er . 

The derivation of the results of Section 4 can be found in most books on 
stochastic processes (e.g. [1]) or any book on Markov chains. More advanced 
books on Markov chains deal with specialized topics - e g. [2] deals with the 
analysis when the states are aggregated, [3] with the analysis of various leve - 
crossing problems. 

Item [4] contains details of derivation for a variety of queueing processes. 
For further details of the derivation of the results of Section 6 see [5]| and [6]. 

Details of analysis of stochastic differential equations (Section 8.1) can be 
found in [7], which also examines the case where X(f) is constrained to lie in a 
specified interval. For level-crossing problems associated with continuous 
state/continuous time processes, see [8], Item [5] gives details o 
derivations of Sections 8.2-8.4 for the compound Poisson process. 

Stationary stochastic processes can be analysed either in time domain or in 
frequency domain. The latter method of analysis is called the “spectral method 
of analysis”; further details of it can be found in [9] and [10]. 
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CHAPTER 9 


Simulation 


1 Introduction 

In Chapter 5 we discussed a variety of deterministic formulations and in 
Chapter 7 a variety of stochastic formulations suited to modelling systems. To 
obtain model behaviour we examined the analysis of these two classes of 
formulations in Chapters 6 and 8 respectively. Unfortunately only a small 
class of formulations can be solved by analytical methods. For the majority of 
formulations the solution can be obtained only by means of some compu - 
ational method. In Chapter 4 (Section 5.2) we mentioned that the implement¬ 
ation of a mathematical model on a computer to obtain the belhav.ou |ms called 
“simulation", and the computer with the instructions to implement the model 

a "in'chapter 6 we indicaled .he dilteren. types of con.pu.ers useful in solving 
mathematical formulations. We went on to discuss some specific techniques 
for solving deterministic formulations on digital computers without going in 
the implementation details. In a sense those solution procedures can be 
regarded as a part of digital computer simulation of systems descnbable by 

deterministic formulations. . . , „„ 

In this chapter we focus our attention on the implementation of a model on 
a computer to obtain the model behaviour. Both analog computer simulations 
and digital computer simulations will be discussed. . 

The simulation procedure depends on the nature of t ie “ n e ^ 
mathematical formulation. Simulation techniques can be d > v ' ^ n o ™ 
ent categories depending on whether changes in the model occur continuo y 
overtme or onfy at dLrete points in time, and whether the changes r 
deterministic or stochastic. The translation of a modelformore correcly 
underlying mathematical formulation) into a program for implementat °" ° w 
a digital computer can be done using a variety of languages. We shal 
some of the more widely used languages and bnefly discuss twcof 
Finally, we conclude with a brief discussion of Monte Carlo method . of 
methods yield approximate estimates of the solution or a 
deterministic formulations by computer experimentation. 
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2 Analog Computer Simulation 

The analog computer is ideally suited to simulating deterministic models 
involving ordinary differential equation formulations. They can be used, less 
effectively, for simulating models involving other types of deterministic 
formulations and stochastic formulations. However, we shall confine our 
attention to simulating models involving ordinary differential equation 
formulations. 


2.1 Principles of Analog Simulation 

As mentioned in Chapter 6 (Section 3.2), an analog computer consists of a 
variety of components (or blocks) - each characterized by one or more input 
voltages and an output voltage. The output and input voltages can change 
with time and are related by some mathematical relationships (e.g. addition, 
multiplication, division, integration). By linking different components (also 
called patching) the voltages become related and satisfy some algebraic and/or 
differential equation. 

If the underlying formulation of a model is an ordinary differential equation 
then, by suitable patching, the differential equation satisfied by the voltages in 
the analog computer simulation can be made identical to a scaled version of 
the given differential equation. Thus the time history of one or more voltages 
corresponds to the solution of the differential equation and hence constitutes 
the behaviour of the model. 


2.2 Procedure for Simulation 

The procedure for simulating a model describable in terms of an ordinary 

differential equation formulation is given by the following four steps. 

Step 1: Scaling the equation. This involves rewriting the given differential 
equation in terms of non-dimensional variables, as discussed in Section 4.3 
of Chapter 6. Note that this involves scaling both the independent variable 
and all the dependent variables and their derivatives. 

Step 2: Drawing the patching diagram. This is a flow diagram which shows the 
connection between different components of the analog computer so that 
the equation relating the voltage is identical to the scaled equation of step 1. 
The diagram is drawn using special symbols for the different components; a 
short list of symbols is given in Figure 9.1. 

Step 3. Patching and checking. Patching involves connecting different compo¬ 
nents using wires (also called patching cords) according to the patching 
diagram. Checking is a procedure to test if the patching is in agreement with 
the patching diagram. 
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Component symbol 



Operational relationship 

e„(t)* k e,(t) 

(OSkSI) 




e 0 (t) * — }e 0 Co) 


-«i(t)—I 
-e 2 (t)—1 



e„(t)- 


e 0 (t) • — e,(t) * e 2 (t) 


-«HTV c„(t) r - «.(t) * -e,(t) /e 2 (t) 

Figure 9.1 A sample of component symbols used in patching for analog simulation. 


Step 4: Running the computer. This corresponds to initiating the sequenceof 
operations to generate the solution and hence the behaviour of the model. 

2.3 Case Study E [Com.] ( World Population) 

In Section 5.5 of Chapter 5 we proposed three models (Models E4 to 
involving ordinary differential equation formulations. In this section w 
examine simulating Model E.5 (i). 


Model E.5 (i) 


The model is given by 

N'{t) = dN(f)/df = 9{L — N(t)}N(t) 

where N(t) represents the total population at time f and the initial condition 


given by 
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N(t 0 ) = N 0 (2) 

Step 1: Scaling. Let N m and N’ m represent the maximum values of N(t) and 
N'(t) respectively. Define n(t) and ri(t) as follows: 

n(t) = N(t)/N m and n'(t) = N'(t)/N’ m (3) 

These are the scaled variables and using (3) in (1) we have 

«'(») = {NJN’ m }0N m {L/N m - n(t)}n(t) (4) 

Note that 

j*n'(f)df jtn(t) (5) 

but 

= {NJN' m }n( t) (6) 

Since L is the upper limit for N(t), we select N m = L. We also consider a 
transformation of the independent variable t to r such that 

t = Pr (7) 

This results in (4) getting transformed into 

dn(i)/dr = <5{ 1 — m(t) }m(t) (8) 

where 

S = POL (9) 

We select P to ensure S < 1. Finally, the initial condition is given by 

n(t 0 ) = n 0 = N 0 /L (10) 

Step 2: Patching diagram. The patching diagram is as shown in Figure 9.2. 
We omit the details of steps 3 and 4. The simulation results for n(r) are as 
shown in Figure 9.3 for two values of 6. From this we can obtain N(t) using (3) 
and (7) for two values of 0 , as 0 and <5 are related by (9). 


3 Digital Computer Simulation 

We shall first consider simulating models involving deterministic formul¬ 
ations. and later the simulation of models involving stochastic formulations. 


3./ Deterministic Model Simulation 

If the model to be simulated involves difference equation formulations (i.e. 
changes to the variables of the model occur at discrete points in time) then 
simulating the model on a digital computer is fairly easy. The difference 
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Figure 9.3 Plot of n(t) for two values of 6 for Model E.5. 


equation can be viewed as a recursive relationship, and the solution (and hence 
the model behaviour) obtained by a simple recursive scheme w tc i ca " 
easily implemented. When the model involves either ordinary d '"^ e " 
equation or partial differential equation formulations, then the e 
cannot be obtained as a continuous function of time (time or spatia var 
in the case of partial differential equation) because of the nature of j h *JJ * 
computer. However, as discussed in Section 3.1 of Chapter 6, the so u cal l 
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be obtained for discrete values of variables by treating the continuous (low of 
the variable by an approximation where the variable is incremented in small 
steps. The study of algorithms to obtain the solution for discrete values of time 
(time and spatial variables) in the case of ordinary (partial differential) 
equations belongs to the theory of numerical analysis. We shall not discuss 
this aspect here, but instead confine our attention to computer 
implementation. 

The computer implementation of a model can be done using either a 
general-purpose language (e.g. Fortran, Pascal) or a special simulation 
language (e.g. CSMP, Dynamo) designed for specific model types. The 
implementation using a general-purpose language requires the user to supply 
the following programs: 

(i) program to define model formulation, 

(ii) program to generate solution, 

(iii) program to display solution and 

(iv) main program to execute (i)-(iii). 

As a result it requires a lot of programming from the user. The programming 
effort can be substantially reduced if a specially designed language is used, in 
which case the user has to write a program to define the model formulation and 
the special language takes care of the rest. 

3.2 Stochastic Model Simulation 

We shall confine our discussion to simulating stochastic models where 
changes occur at discrete points in time - as in the case of discrete time models 
(with continuous or discrete states) or continuous time models with discrete 
states. The simulation of stochastic models differs from deterministic models in 
one important sense. In stochastic model simulation each simulation yields a 
different time history of model behaviour. This is to be expected, as each time 
history corresponds to a particular outcome (or realization) of the underlying 
stochastic process formulation. 

The simulation of a stochastic model requires generation of random 
numbers, updating the clock (which represents the time variable of the model) 
and other variables using the random numbers generated earlier; plotting or 
displaying the time history of model behaviour and finally compiling the 
statistics of the model behaviour based on a specified number of independent 
simulations. Since it is not possible to generate truly random numbers, 
simulation of stochastic models is done by using pseudo-random numbers 
generated by algorithms. A variety of such random-number generators have 

cn developed for generating random numbers with specific statistical 
Properties. 

A s in the case of deterministic model simulation, the computer 
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implementation of a stochastic model can be done us ^| e ' ther a general- 
purpose language or a special simulation language (e.g. * . . 

designed for specific model types. We shall review some of these spec.a 
languages in a later section. We illustrate the principles of stochastic model 
simulation by considering the following case study. 

3.3 Case Study K [Cont.] (Supermarket Operation) 

In Section 6.5 of Chapter 7 we proposed two models (Models K.l and K.2) for 
the checkout counter operation. The behaviour of Model K.l was obtained by 
analytical methods in Section 5.2 of Chapter 8. It is extremely difficult to 
obtain the behaviour of Model K.2 by analytical methods. In this section we 
discuss the use of simulation to obtain the model behaviour. 


Model K.2 

We assume that there are S counters. Customers arrive according to a point 
process with inter-arrival times being a sequence of independent random 
variables with a distribution function F. The service times are also independ¬ 
ent random variables with a distribution function G. When a customer arrives, 
the service starts immediately if one of the counters is free. If all the counters 
are busy the customer joins the queue at the counter which has the least 
number of people waiting to be served. (If there be more than one such counter, 
the customer chooses among them randomly. The same is true in the selection 
of a counter if more than one counter is free at the time the customer arnv«.) 

The passage of customer through the checkout counter can be described by 
a flow diagram shown in Figure 9.4, and is the starting point of simulation. 

Note that random-number generators are needed to execute blocks 1 and 6 
of the flow diagram. In addition, one needs a clock to characterize passage of 
time and tables to keep track of queue lengths (block 8) and idle times of server 
(block 10) for each of the S different checkout counters. 

A typical plot of the time history is shown in Figure 9.5 for the case of a 

single counter operation. 


4 Digital Simulation Methodology 

Since, in digital simulation, the underlying formulation is being implemented 
as a program on a computer, the nature of a mathematical model suitable tor 
simulation is in general different from that for analytical treatment, 
building a model for analysis by analytical methods it is necessary to ca 
mind the constraints and limitations of such methods. When a mat e 
model is being built for simulation, no such constraint needs to e imp 
Models are built in terms of simple equations and logical expression , 
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Figure 9.4 Flow diagram for simulating Model K.2. 


organized in a way that enables a routine procedure to be used to solve them. 
Typically, they are built by dividing the system into a series of sections. Each 
section is described in a straightforward manner (using logical expressions and 
simple mathematical equations) without undue concern for the complexity 
introduced by having many such sections. 

We use the terms “simulation model” and “simulation modelling” to mean 
the following: a mathematical model developed to study the behaviour of a 
s >stem by simulation is called a simulation model and the process of building 
such a model is termed simulation modelling. As a consequence, a special 
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0 t, t 2 t 3 U t 5 t 6 

Customer arrivals 



Service times 



Figure 9.5 A simulated model behaviour for Model K..2. 


terminology has evolved for use in simulation modelling, which will now be 
described. 

4.1 Brief Introduction to Simulation Terminology 

In simulation modelling the system is viewed as consisting of various sections 
(or subsystems). Each section is described in terms of entities, and the term 
attribute denotes a property of the entity. Any process (internal or external) 
which causes changes in the system over time is called an activity. * 
initiation, alteration or conclusion of an activity is called an event. The term 
state of the system is used to mean a description of all the entities, attributes 
and activities as they exist at one point in time. The evolution of the system is 
studied by following the changes in the state of the system. 

Entities and attributes make up the static structure of a simulation model. 
They describe the state of a system but not how the system operates. This is t c 
task of activities and events. The key to simulation is the programmer s abimy 
to organize system events so that the order in which they are execut wi _... 
the computer corresponds to the order in which they would occur in the rern 
system. This preserves the time relationships between simulate an 
events. 
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In the case of Model K.3, discussed in the previous subsection, the servers at 
the counters would be called entities, customers would be called activities and 
the arrival and departure of customers would be called events. 

The simulation methodology consists of the following two steps: 

Step I: Building the simulation model. This is done by breaking the system into 
sections and identifying entities, attributes, activities and events. 

. Step 2: Writing the computer program. This converts the simulation model 
into a simulation program. It consists of: 

(i) writing a set of event programs to describe the system operating rules; 

(ii) defining lists and matrices that store information; and 

(iii) writing an executive routine that directs the flow of information and 
control. 

4.2 Classification of Simulation 

Simulation can be classified into two types: discrete simulation and cont¬ 
inuous simulation. In discrete simulation, changes in state occur at discrete 
points in time. Thus, activities are not modelled explicitly; they are represented 
by modelling the events that affect the state of the system. In continuous 
simulation, changes in state occur continuously over time; hence activities 
need to be modelled explicitly. In both cases the flow of time is recorded by a 
clock in the simulator. 

Since changes over time cannot be represented continuously on a digital 
computer, time is advanced in small increments to approximate the cont¬ 
inuous flow of time in the system being simulated. Thus, in the simulation of 
continuous systems (i.e. systems modelled by differential equations), the 
simulation transforms the differential equation formulation into a difference 
equation formulation. 

In discrete simulation, time is broken into intervals. The beginning and end 
of each interval corresponds to the occurrence of an event. These intervals are 
not necessarily equal and in the simulation of a stochastic system they are 
random variables. The flow of time is achieved by updating the clock at time 
instants at which events occur. 

4.3 Event \s Activity Oriented Simulation 

In discrete simulation, events and activities play a major role. The key element 
,s simulation algorithm, which has a structure as shown in Figure 9.6. 
Basically it involves finding the next potential event - executing if possible. 

e execution involves advancing the clock (which represents flow of time in 
me simulation) and changing the state of the system. 

n l ^ e C0m puter the state is represented by numerical values and is called the 
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Find next 

potential event 


i r_ 

Select activity 


T _ 

Test if it 

can be done 



I 


Change system state 
and advance clock 


—r~ 

Gather $toti*tic* 



Figure 9.6 Simulation algorithm for discrete simulation. 


“system image”. Changes in system image over time correspond to behaviour 
of the simulation model. 

The key factor in simulation is the control of sequences that repr 
changes in the simulation. There are two different ways of executing this 
control, which lead to two different types of simulation - event-onen w 
simulation and activity-oriented simulation. In event-oriented simulation 
emphasis is on event finding, and the simulation algorithm automatic y 
obtains the next event when all the state changes are completed. In acuv y 
oriented simulation the emphasis is on activity starting and stopping, a 
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approach is useful where an activity can start or finish only when many 
conditions are met. 

The subtle difference is that, in event-oriented simulation, the times at which 
system changes occur are treated as characteristics of the activities, while in 
activity-oriented simulation these times are treated as attributes of the entities. 
This difference is important, as some languages are based on the event- 
oriented approach while others are based on the activity-oriented approach. 
The difference is in the control of program execution, but the final simulation 
remains the same. 


4.4 Verification and Validation 

A simulation model becomes an adequate model once it is validated (i.e. passes 
the test of validation). In simulation literature, a term called “verification" is 
often used. Verification is a testing procedure (similar to validation) which 
establishes the faithfulness of the computer output to the model response. 
Thus the computer output is in agreement with the system behaviour only 
when the simulation is verified and validated - i.e. verification establishes that 
the computer output is in agreement with the simulation model behaviour, 
and validation establishes that the simulation model is an adequate represen¬ 
tation of the system. 


5 Simulation Languages 

Though almost any type of programming language can be used to implement a 
simulation model, various special languages have been developed for simul¬ 
ation applications. These languages provide the user with a variety of services; 
they are designed to ease the job of translating a simulation model into a 
program to simulate the system behaviour on a computer. They contain: (a) 
status descriptions, which are definition of the essential element of the model; 
(b) procedures for modifying the state of a model; and (c) procedures for 
controlling and observing the model behaviour. In addition to these, extensive 
error checking exists to prevent misuse of the language, as well as routines for 
debugging and locating logical errors. 

Simulation languages can be divided into three groups as indicated below: 

(i) continuous simulation languages, 

(ii) discrete simulation languages, and 
hii) combined simulation languages. 

It is not possible to discuss all the simulation languages. In the next two 
subsections we briefly review some of the more widely used languages and 
comment on their salient features. 
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5.1 Continuous Simulation Languages 

Continuous simulation languages deal with the simulation of continuous time 
systems modelled by differential equations, on digital computers in essentially 
the same way they are solved on analog computers. In a sense, a program to 
allow this kind of operation converts the digital computer to a digital-ana og 
simulator. This requires an approach similar to that used on an ana log 
computer; e.g. the user constructs functional diagrams which are similar to the 

patching diagrams used in analog computing. . 

A number of continuous simulation languages have been developed which 
offer the user great freedom in modelling a system and translating it into a 
program. They make use of statements which allow direct programming of the 
equations of a simulation model and include a variety of algebraic and logical 
expressions to describe relations between variables. We review two of the 
many continuous simulation languages to give some idea of the approach 
taken with this class of simulation language. 

5.2 CSMP (Continuous System Modelling Program) 

This is specially designed to simulate models involving ordinary differential 
equations. There is flexibility with regard to the choice of the numerical 
integration method to be used in the simulation. The language is Fortran- 
based and is constructed of three types of statements. 

(i) structural statements which define the simulation model, 

(ii) data statements which assign numerical values to parameters; and 

(iii) control statements which specify options for the execution of the program 
and the choice of output. 

The output can be displayed in tabular form and/or printed as plotted graphs. 
5.3 Case Study D [Cont.] (Rocket Launch) 

The model formulation for this problem was done in Section 5.4 of Chapter 5, 
where we discussed the solution by analytical methods. The model is given by 

x"{t) = - (k/M)x'(t) - {0 O K 2 }/MO + R) 2 ' !) 

with the initial conditions given by 

x(0) = 0 and x'(0) = v (,2) 

For simulating the model using CSMP we need to rewrite (11) as a system of 
first-order ordinary differential equations. This is done by t e tec mq 
discussed in Section 5.2 of Chapter 5. 

Define « 


y(t) = x'(r) 


(13) 
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Then (11) can be rewritten as 
x'(0 = y(f) 

yV) = - (k/M)y(t) - {g 0 R 2 }/(x(t) + R) 2 (14) 

with initial conditions given by 

x(0) = 0 and )\0) = v (15) 

* The computer program for simulating the model in CSMP is shown in 
Figure 9.7. 

We briefly comment on the various statements of the program. 

(i) The first statement defines the title. 

(ii) The next two statements define the equations of the model. 

(iii) INTGR L (XO, XDOT) is a statement to integrate x starting with an initial 
value XO. 

(iv) CONST defines the numerical values for the parameters and the initial 
conditions. 

(v) TIMER defines the parameters for integration. The terms used stand for 

DELT: Integration interval 
F1NTIM: Finish time 
PRDEL: Interval at which to print results 

(vi) PRINT defines that both X(t) and Y(t) are to be printed. 

(vii) PARAM is a statement to repeat the simulation with different values of 
V'O as indicated. 

5.4 DYNAMO 

In simulating with this language the model is broken down into a number of 
reservoirs, or “levels", interconnected by flow paths. A level represents the 

TITLE ROCKET LAUNCH 
XDOT = Y 

YDOT = -(K/M)* Y — GO* R** 2/(X + R)**2 
X = INTGRL(XO. XDOT) 

Y = INTGRL (YO, YDOT) 

CONST K = , M = , G0 = 

R = , XO = , Y0 = 

TIMER DELT = 0.1, FINTIM = 500. PRDEL = 10.0 
PRINT X. Y 

PARAM Y0 = ( % . , . ) 

END 

Figure 9.7 Program of simulating Model S.l using CSMP. 
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accumulation of an entity, and corresponds to the accumulated difference 
between the input and output Hows for that entity. The instantaneous nows are 
represented by "rates”. The rate is related to the level by the expression 

Rate = level/delay 

where the "delay” is a characteristic time. Thus, if x(l) represents the level of an 
entity and its rate be given by 

dx(f)/df = -x(t)/T 06) 

then T represents the “delay”. The smaller the value of T (or delay), the greater 

is the rate, and vice-versa. ...... 

As in CSMP, output can be in the form of tables and/or printed plotted 

graphs. 

5.5 Discrete Simulation Languages 

In Section 4.3 we discussed two ways (event-oriented and activity-oriented) of 
implementing discrete simulation. Two other ways of implementation are 
process-oriented and transaction (low. Unfortunately, the terminology is not 
consistent. As mentioned earlier, there are a large number of simulation 
languages; each uses one of the above-mentioned four ways for implement¬ 
ation of the simulation. The most commonly used languages are GPSS, 
SUBSCRIPT, GASP and SIMULA. 

5.6 GPSS (General-Purpose System Simulator) 

This is a process-oriented-type discrete event simulation language. It is a block 
diagram language. A number of basic block types are defined, each represent¬ 
ing a particular action. The user must describe the activities of the system in 
terms of these basic block types. Permanent entities are identified with 
predefined concepts such as facilities, storages and logic switches. The 
sequence of activities is established by a block diagram indicating how the 
blocks are interconnected and, where necessary, the conditions for a choice 
between alternative paths. With this the GPSS is able to establish a system 
image as a set of well-defined tables. Finally, the gathering of statistics is 
controlled by certain block types together with internal routines. A fixed 
report generator gives the desired statistics. The language allows for editing 
the output with explanatory comments. 

5.7 Case Study K [Cont.] (Supermarket Operation) 

The model building for this problem was done in Section 6.5 of Chapter 7, and 
in Section 3.3 we briefly discussed the simulation of Model K.2. for 

In this section we give the program to simulate Model K.2 using 
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5=1, i.e. there is only one checkout counter. The simulation for S > 1 is 
slightly more complicated as it requires more conditional testing. We assume 
that the interval arrival time for customers is a random number uniformly 
distributed between 70 and 130, and that the customer service is also a random 
number uniformly distributed between 50 and 90. 

As mentioned earlier, GPSS is a block diagram simulation language. The 
first step in the simulation is to describe the model in terms of the basic blocks. 
This is done in Figure 9.8, which shows the flow chart as well as the GPSS 
block diagram for the model. The GPSS program is given in Figure 9.9. 



Figure 9.8 Flowchart and GPSS block diagram for Model K.2. 
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TIME 


GENERATE 

100.30 

0) 

QUEUE 

LINE 

(2) 

SEIZE 

COUNTER 

(3) 

DEPART 

LINE 

(4) 

ADVANCE 

70.20 

(5) 

RELEASE 

COUNTER 

(6) 

TABULATE 

XTIME 

(7) 

TERMINATE 

1 

(8) 


Figure 9.9. Program for simulating Model K.2 using GPSS. 


We briefly comment on some of the statements of the program. 

(i) The program uses eight different block types (as shown in the right half of 

Figure 9.8). . f ... 

(ii) The queue formed is labelled “LINE" and the serv.ce facility 

“COUNTER". 

(iii) GENERATE creates inter-arrival times. 

(iv) ADVANCE creates service times. 

(v) SEIZE and RELEASE modify the status of service facility. 

(vi) TABULATE is for accumulation and printing of information and the 
table used is labelled XTIME. 


5.8 SUBSCRIPT 

SIMSCRIPT is an event-oriented simulation language. In simulating with 
S1MSCRIPT each type of entity is individually defined by giving names to 
entities and to their attributes, and specifying their data structure. The 
activities are represented by individually written routines (called event 
routines) written in the SIMSCRIPT language. The linking of activities is 
carried out by event notices, which are also individually defined by the user. 
Each event notice corresponds to an event due to occur at a particular t ' n ’ e - 1 
identifies the activity (event routine) to be activated by an event notice, and the 
user must create further event notices for scheduling follow-on events. A 
separate form is used to describe the output required from the program, 
allowing extensive editing of the results. 

5.9 GASP 

This is a combined simulation language i.e. it allows for simulating models 
where changes occur continuously with time as well as occurring at ls< ; 
points in time. Changes occurring discretely are described throug en 1 
associated variables and sets which are referred to as a file. Changes oucu g 

continuously arc described through state variables. The dynamic e av 

described by means of two types of events - time events and state ev f n s . 
former are scheduled to take place at future points in time, while the 
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occur whenever a state variable meets some prescribed condition. A user- 
written routine needs to be supplied for each type of event. In addition, GASP 
offers a range of packages for data gathering and reporting. 


5.10 SIMULA 

This is a process-oriented type discrete event simulation language. The model 
to be simulated is viewed as consisting of a collection of processes. The actions 
and interaction of the processes comprise the dynamic behaviour of the model. 
The process is an entity which can exist in one of four states - active, 
suspended, passive and terminated. For each process the user must supply a 
routine that describes the process. In SIMULA, future processes are 
sequenced in a sequencing set according to their time attribute. The language 
also provides packages for the collection of statistics. 


6 Monte Carlo Method 

In Chapter 6 we discussed analytical and numerical techniques for solving 
deterministic formulations. An alternative technique, useful in solving certain 
types of deterministic formulations, is one which can be termed ‘'experimental 
mathematics”. A properly designed experiment using random numbers is 
carried out many times on a computer. The outcome (a random variable) of 
each experiment is noted. The solution to the deterministic formulation is 
given in terms of an estimate based on these outcomes. The estimated solution 
is a random quantity, characterized by mean and variance. If the experiment is 
properly designed the estimated solution has mean equal to the true solution 
and a small variance, thus making it meaningful. In general, the variance goes 
down as \/N where N is the number of times the experiment is repeated. Hence 
by choosing a large N the estimated solution is fairly close to the true solution. 

The technique is applicable only when a suitable experiment can be devised 
so that the outcome of the experiment can be related to the true solution. This 
technique is known as the Monte Carlo Method. 

We illustrate by giving two examples. 


Example 1 

The problem is to evaluate the integral 

/ = J/(x)dx (17) 

where/(x) is a specified function. For simple forms of /(.*) the integral can be evaluated 
analytically. For any arbitrary form of f(x) the integral can be evaluated by one of many numerical 
methods. A Monte Carlo method to obtain an estimate of / is given by the following algorithm. 

U) Generate a sequence l/ f , 1 ^ i ^ N, from a uniform distribution over [0,1] 

U Compute = a + ( J.(b — a), 1 ^ i N . 

U> Compute /(*,), 1 ^ i ^ N. 
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(4) Compute the estimate / by 


/=(h-a)| 


(18) 

Then it can be shown that 



£[/] = / 

(19) 

and 

r 

Cb ~\ 1 


var[/]= ( b-a) 

L 

/ J (x)dx-/ 2 N 


indicating that the variance ->0 as N -*cc. 

J. J 1 

V __ " \ 4-- t C’ .i rlrv mat and 

(20) 

■ C nnA rtf mon.i 


The above method is called the “Simple-Mean” Monte Carlo method and is one of many 
methods for evaluating such integrals. 


Example 2 

Consider the parabolic equation 

d/dt{U{xj)} ={\/2)d 2 /dx 2 {a 2 {xj)U(x % t)} +d/dx{P{xj)U(xj)} (21) 

with the I.C. and B.C. given by 

I.C.: U(x,0) = 6(x) B.C: U(x,f)-0 as |x|-oo (22) 

This equation can be easily solved by numerical methods. We will illustrate the use of the Monte 
Carlo simulation technique to obtain the solution U(x,f) for some specified t = t*. 

Consider a stochastic differential equation given by 

d2(0 = 0(Z, f>dr + ot(Z, t )d W(f), Z(0) = 0 (23) 

where W(t) is the standard Weiner process. From Section 8.1 of Chapter 8 we have that the 
probability density function piz,t) of Z(f) satisfies the equation 

d/dt{p(zj)} =(l/2)r? 2 AV{a 2 (r,f)p(z,f)} + d/dz{P(z,t)p(zj)} (24) 

with 


I.C: p(z,0) = £(z) 

B.C.: p(z, t) = 0 as z -> ± oc (25) 

Thus we see that p(z,f) is identical to U(xj). Hence once p(zj*) is obtained, we have the desired 
solution U{xj*): p(z,t*) can be obtained using Monte Carlo simulation as follows. 


The stochastic differential equation is simulated until time t = t* starting with an I.C., Z(0)=0. 
The value of Z(f*) is noted; let this be denoted by z,. The experiment is repeated N times to 
yield 2 2 ,z 3 ,...z„ where z, is the value of Z(f*) for the ith experiment. From histogram plot of 
the observed values an estimate of p(z,f*), and hence of U(x,f*), can be obtained. The larger 
the value of N, the smaller the uncertainty - in other words, the more accurate the estimated 
solution. 


Notes and References 

Items [l]-[5] show a small sample of the large number of books on 
simulation modelling and analysis. Most simulation books discuss the 
different aspects of simulation - viz. modelling, verification, validation etc. 
Item [6] develops a formal theory for simulation modelling and [7] is an 
example of the large number of books dealing with statistical aspects o 
simulation. 
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Discrete system simulation has received more attention than continuous 
system simulation. For further details of simulating continuous systems see 
[8]; of CSMP, see [9] and of DYNAMO, see [10]. 

For more on discrete system simulation see [11]. Details of specific 
simulation languages can be found in many books devoted to the topic; [12]- 
[17] are a small sample. 

Our presentation of Monte Carlo methods has been very brief. Monte Carlo 
methods have been used for solving a variety of mathematical formulations. 
For further details, see [18] and [19]. 

The list of books on simulation models in specific disciplines is very large 
and [20] [25] are a small sample. 
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CHAPTER 10 


Parameter Estimation 


1 Introduction 

We have used the term "parameter” in a number of ways in earlier chapters. In 
Chapter 3 we used it in the context of system characterization. To clearly 
distinguish this from others, we will designate such parameters as system 
parameters”. In Chapters 5 and 7 we introduced parameters in the process of 
developing mathematical formulations. These parameters will be called 
"formulation parameters". When a formulation is used to model a system, the 
"formulation parameters" become “model parameters . 

For a given mathematical model, one needs to solve the underlying 
mathematical formulation to obtain the model behaviour. As indicated in 
Chapters 6 and 8, specific numerical values must be assigned to the model 
parameters to obtain the solution. The problem of finding the most 
appropriate values is called "parameter estimation . Different approaches 
must be used, depending on the nature of the underlying mathematical 
formulation. In this chapter we discuss methods of estimation for both 
deterministic and stochastic models. 


2 Some Preliminaries 
2.1 Notation 

We shall denote by 9 the unknown model parameters which need to be 
estimated; 0 can be either a scalar or a vector. In general 9 is constrained to 
belonging to a set © specified as a part of the model formulation as illustrated 
by the following two examples. 

Example I 

Consider a model involving a time-invariant linear differential equation formulation 

i=i 

where >■"'(/) is the jib derivative of > 10 ; x (0 and >10 representing the model input and output 
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respectively. The parameter 0 is given by an n-dimensional vector 

0 r =!» |.*2 .*.! ( 2 ) 

As a part of model formulation one might specify that bounded input must produce bounded 
output; in other words, the model must be stable. This is assured if the parameters are suitably 
constrained so as to ensure that the roots of the polynomial equation 

n 

Ph) = s" + £ a,.s J "-•' = () (3) 

i= I 

lie in the left half-plane; i.e. they all have negative real parts. In the case n - I, the constraint is 
given by 

0:{ot >0} (4) 

For n > 1 it is given by a system of algebraic inequalities. 


Example 2 

Consider a model involving a stationary Markov chain formulation with two states. In this case 
the formulation is given by a 2 x 2 one-step transition matrix 


i -\1" M 

(5) 

L^ 2 l P22J 

with the constraint set 0 given by 

I. 1 ^ i. 7 ^ 2 . and 

2 

IP„=I. I<i <2 

( 6 ) 

i- 1 

As indicated above wc have four parameters - i.e. 

0 T -\Pu*Px»Pix ^ 22 } 

(7) 

An alternative, but equivalent, characterization is as follows; 

r'-'■■•l 

( 8 ) 

Ll-P,, />„ J 

with 

0 r =\P„.P 12 1 

(9) 

and 

(10) 


The advantage of the second characterization is that it has fewer parameters, as well as fewer 
constraints. 


We shall employ the following notation throughout this chapter: 

0: unknown parameter, 

true value of 0, 

0: estimate of 0 obtained using the system data, 

h: dimension of 0, 

y n >x m : model output and input respectively, 

y\x*: system output and input respectively. 

(Note y #n ,x m ,y s and x 5 can be scalar or vector.) 

In the case of a static system [model], y*> x s [y m , x m ] are variables; in the case of 
a dynamic system [model] y 5 , x 5 [y m , x m ] can be either sequences or a function 
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of one or more independent variables representing time and spatial 
coordinates. 

T[N~]: Length of data available for parameter estimation with continuous 
[discrete] time data. 

2.2 Estimation 

In many instances the model parameters are related to the system parameter 
on a one-to-one basis. In this case it is sometimes possible to obtain estimates 
for some of the model parameters by directly measuring the corresponding 
system parameters, as illustrated by the following example. 

Example 3 Case Study D [Cont.] (Rocket Launch) 

A model for the motion of the rocket was given in Section 5.4 of Chapter 5. It involved a second- 
order differential equation formulation. One of the model parameters is M , the coefficient of the 
second derivative. It represents the mass of the rocket. Thus, in this case, we can obtain the true 
value of M by measuring the mass of the rocket directly. 

Often it is not possible to measure system parameters directly. In the case of 
“black box” modelling, the system parameters are not defined, and hence no 
such one-to-one relationship exists between the system parameters and the 
model parameters. In these cases the model parameters need to be estimated 
using the system input and output data. 

A scheme for estimating parameters is called an estimator and the values 
obtained are called estimates. 


2.3 System Data 

The system data (available for parameter estimation) are given by the observed 
values of the input and output variables in the case of a static system, and by 
their time histories in the case of dynamic systems. The data available are often 
less than complete, and in many instances contaminated by external factors, as 
illustrated by the following example. 


Example 4 

Suppose that the changes in the system are occurring continuously in time. In this case complete 
data would correspond to measuring the variables continuously in time. Often it is not possible to 
do this. Suppose the variables are measured using a sampling scheme which records the values at 
discrete time instants. In this case, instead of complete data we have only partial data, as some 
information is lost due to sampling. Since a variable is measured using some physical device 
(sensor), the measured value will differ from the true value due to what is commonly termed 
“measurement noise" (or error) which affects the measurement. If the magnitude of this error * 
small in relation to the magnitude of the variable being measured, then it can be ignored amltn 
measurement can be viewed as being nearly accurate. If not, the effect of this error must be taK 
into account and the data treated as inaccurate. 

The method for estimation depends on type and quality of data available for 
estimation. 
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2.4 Deterministic vs Stochastic 

In deterministic systems there is no uncertainty. Unfortunately, systems are 
seldom deterministic, but if the uncertainty is small then it can be ignored and 
the system can be viewed as a deterministic system and modelled adequately 
by a deterministic formulation. In this case the model parameters can be 
obtained using a relatively small length of data, and the estimates can be 
viewed as being deterministic. 

In stochastic systems the system variables change in an unpredictable 
manner. As a consequence an estimate based on finite length of data is also 
random. It can be characterized either through a probability distribution 
function or in terms of its moments. As the data length increases, the estimates 
might or might not converge in some probabilistic sense. If the estimates 
converge to the true value as the data length approaches infinity, then the 
estimates are said to be a consistent estimate and the estimator a consistent 
estimator. The convergence can be one of many types - e.g. convergence in 
probability, convergence in mean square or convergence with probability one. 
Convergence in mean square sense implies that as the data length approaches 
infinity, the mean of the estimate equals the true value and the variances of the 
estimate go to zero. 

A variety of estimators has been developed for estimating the parameters 
of different types of stochastic models. An estimator is said to be unbiased if the 
mean of the estimate equals the true value. Obviously, the estimator with the 
smallest variance is the most desirable. For the case where the data length is 
sufficiently large, the maximum likelihood estimator has this property. We 
shall discuss this aspect further in a later section. 

Comments 

(1) We use the term “model parameter” in the widest sense. For example, it can 
include the order of the formulation, as in an autoregressive moving- 
average time series formulation; the parameters of the distribution 
characterizing uncertainty in the system and/or measuring device and so 
on. 

(2) In Chapter 4 we discussed a model as a mathematical formulation related 
to a system characterization. To obtain model behaviour we dropped the 
association between the formulation and the system characterization. This 
allowed the analysis of the formulation to be carried out without the 
encumbrance of the system characterization. The behaviour of the model 
was obtained by re-establishing the relationship after the analysis was 
completed. We employ a similar approach here. As such, when we use the 
term model, we refer to the underlying mathematical formulation and 
ignore the relationship with the system characterization. Hence, estimat¬ 
ing model parameters is really treated as estimating the parameters of the 
underlying formulation. 
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3 Deterministic Model Parameter Estimation 

In Chapter 5 we discussed a variety of mathematical formulations suited for 
use as deterministic models. Although the structures of the formulations vary 
considerably, the approaches to parameter estimation are nearly the same for 
all. The two basic approaches to parameter estimation are: 

(1) equation error approach, and 

(2) output error approach. 

In this section we commence with a restatement of some of the model 
formulations discussed in Chapter 5. This is followed by a brief discussion of 
data aspects in estimation. The equation error and output error approaches 
are then discussed. 


3.1 Restatement of Formulations 

We use the notation mentioned in the previous section - i.e. x represents the 
input and y the output. 

A representative static model is given by 

y m = f(x m ,0) (ID 


and the data available for estimating the parameter 0 are assumed to be given 
by 

{xj.y?}, ( |2 > 

A representative discrete time dynamic model is given by 

yr = g(A[yn„A[xn l '0) < 13 > 

where [A*,L = {W," 4) 

and the data available for estimating the parameter 0 are assumed to be given 
by 

{x?,y?}, l^f^T (15) 

If xj and/or /, is missing for some values of t, in the interval 1 < f ^ T, then we 
have incomplete data. 

Similarly, a representative continuous time dynamic model involving 
ordinary differential equation formulation is given by 

g{DWMn>Dtx m ('M = 0 

where 

D[z(r) ]„ = {z(t), z' 1 '(f), z' 2) (0,. • •, z'"'(f)} 

and 

2 0) (t) = d‘z(f)/df‘, iSsO 


(16) 

(17) 

( 18 ) 


PARAMETER ESTIMATION 


181 


The data available for estimating the parameter are given by 

{/(f),x 5 (r)}, OsSrsST (19) 

If the data available for estimation are given by 

Mf),x s (f)}, r = t„t 2 . t N (20) 

instead of (19). then we have reduced data as opposed to complete data. The 
reduction is due to sampling. 

We assume that 0 is an /i-dimensional vector and constrained to satisfy OeQ, 
a specified set of constraints. 

3.2 Equation Error Approach 

The method is shown schematically in Figure 10.1. The observed values of x 5 
and y s are substituted in place of x m and y m in the modei formulation (given by 
(II) for static model; (13) for discrete time dynamic model and (16) for 
continuous time dynamic model). As a result, the formulation can be viewed as 
an algebraic equation with 0 being the only unknown. If the number of 
algebraic equations so created equals n, then these can be solved to yield an 
estimate 0. 

When the data available are large, then the number of algebraic equations 
resulting from the above-mentioned substitution is greater than n. If the model 
is a true representation of the system, the set of algebraic equations form a set 
of consistent equations i.e. solving any n equations will yield the same 
estimate 0 = 0*. However, a model is only a simplified representation of the 
system. As a result the set of algebraic equations do not form a set of consistent 
equations. In other words we have an overdetermined set of inconsistent 
equations. In this case the estimate (1 is obtained by a least-squares approach 
which minimizes the sum of the square of errors in the set of algebraic 
equations. 



Figure 10.1 Equation error approach. 
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A major drawback of the approach for estimating continuous time dynamic 
model parameters is that it requires differentiating the system data. This is not 
desirable as often the data are contaminated with sensor noise which gets 
amplified under the operation of differentiation. An alternative method 
suitable for continuous time linear dynamic models, and not requiring any 
differentiation of the data, will be discussed in Section 3.4. 


3.3 Output Error Approach 

The method is shown schematically in Figure 10.2. In this approach the model 
response (i.e. y m ) is obtained for 0 — 0 using .v m = x s . Let c(0) denote the error 
between the model output and the system output — i.e. 

e{9) = e,(0) = f, - y7> (21) 

in the case of discrete time model, and 

e{0) = e(t , 0) = /(f) - 0 < t T (22) 

in the case of continuous time model. 

Based on e(0), we evaluate a function J{9) given by 

J(9) = t {e,(0)} 2 /H < 23 > 

1 “ 1 

in the case of discrete time model, and 



in the case of continuous time model. 



Figure 10.2 Output error approach. 
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J(6) is an indicator of the misfh between the system and the model outputs 
with the model parameter 9 = 0. The estimate 0 is the value of 9 which 
minimizes J(0). It can be obtained by using one of many iterative schemes 
available for function minimization subject to constraints, as 9 is constrained 
to belong to ©. 

3.4 Special Method for Linear Dynamic Model 

This method is applicable only for models involving linear ordinary different¬ 
ial equation formulations with time-invariant parameters. The parameters of 
the model are the coefficients of the differential equation. If the model input be 

x m (f) = sin(a>r) (25) 

then the output under steady state conditions is given by 

y m (r) = A m (a>,0) sin (cut + <f>(w,9)) (26) 

where A m (a),0) and <^(co,0) are functions of the frequency m and model 
parameter 9. (They are called “gain” and “phase" respectively.) By selecting 
x J (f) = x m (f) as given by (25) we can experimentally measure the resulting 
system ouput /(r) and hence obtain the system gain A s ((o). Equating the model 
gain with the system gain i.e. 

A\a)) = A m (u>,0) (27) 

we have an algebraic equation with 9 being the unknown. By obtaining A*(io) 
for different values of co, we have n algebraic equations which can be solved to 
yield an estimate 9. We illustrate by a simple example. 

Example 5 ( First-order Model) 

The model is given by 

dy"(r)/df = ay m {t ) + bx m {t) (28) 

with the parameter 0 and the constraint © given by 

9={a % b }, 0 = {a<O,h*O} (29) 

Using x"(r) given by (25) we have 

A m {a), 0 ) = b/y/(a 2 + a) 2 ) (30) 

Thus, by experimentally evaluating A\oj) for two different values of <u, and using (27) with A m (io s 0) 
given by (30), we can solve for a and b. Note that the solution yields two values for a - one negative 
and one positive. The negative is the one to be accepted as the positive violates the model 
parameter constraint given by (29). 

Case Study A [Cont.] ( Weight Lifting) 

In Section 3.2 of Chapter 5 we discussed various models for relating the weight 
1 ^*h e body weight (W). In this section we consider estimating model 

parameters for Models A.2 and A.3. 
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Model A.2 


The model is given by 


L = kW 2li 


(31) 


with 0 and 0 given by 

e = {k}, @ = {k> 0 } (32) 

Using the data given in Table 2.2 and the equation error approach, we have 

k= 18.49 (33) 

Model A.3 

The model is given by 

L = k(W-W 0 ) l > (34) 

with 6 and 0 given by 

0 = {k,W o Jl Q = {k>0,W o >0J>0}. (35) 

We assume W 0 = 30. Then, using the data given in Table 2.2 and the equation 
error approach, we have the following estimates: 

fc = 64.256, P = 0.44335. (36) 

3.6 Case Study E [Cont.] (World Population) 

1 n Section 5.5 of Chapter 5 we discussed a variety of models for the evolution of 
N(t) y the total population at time t. In this section we consider estimating the 
model parameters for Model E.5. 


Model E.5 

The model is given by 

dN(t)/dt = f(N(t)J)N(t) (37) 

We consider a specific form for / as indicated below: 

f(N(t),0) = a(L-N(t)) (38) 

with the parameters and the constraints given by 

0={a,L}, 0 = {L > 0} (39) 


We assume a value for L and estimate a using the output error approach and 
the data given in Table 2.3. The estimated values are as follows. 

(i) L= 15 x 10 9 d = 2.307 x 10' 3 

(ii) L = 20 x 10 9 <3 = 1.722 x 10“ 3 . 


(40) 
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4 Stochastic Model Parameter Estimation 

In Chapter 6 we discussed a variety of formulations suited for use as stochastic 
models. A number of methods have been developed for estimating stochastic 
model parameters. Some of the methods are applicable for all types of models 
whilst others are applicable to specific models. In this section we discuss the 
three commonly used methods. They are: 

(i) method of least squares; 

(ii) method of moments; and 

(iii) method of maximum likelihood. 

We employ the notation defined in Section 2 and the standard convention of 
using upper-case letters to represent random variables and lower-case letters 
to represent their outcomes or realized values. 

4.1 Method of Least Squares 

The method of least squares is ideally suited for estimating parameters of 
regression models. As indicated in Section 4.3 of Chapter 7, a regression model 
is of the form 

>7 = 0(xr,0)+ VT (41) 

where V™ is an unknown random variable with zero mean and finite variance. 
V, is random due to VT being random. 

Let the system data available for estimating 0 be given by 

(xlrf), 1</<N (42) 

(Note: is the realized outcome of TJ which is random due to uncertainty in 
the system.) 

The estimate 0 is obtained by minimizing J(0) given by 

m= t {fi-gixlW/N (43) 

1 = 1 

i.e. 9 is the value of 0 which yields a minimum value for J(9). Note that the 
parameters are obtained by viewing yf as a realization or outcome of the model 
with x" = xj. Because y? is a random outcome, the estimate 9 is a random 
variable. If the sequence {K,} is an independent sequence, then as N-* oo, 
d-*9* in probabilistic sense as defined in Section 2.4. 

The method of least squares is also suited for estimating deterministic model 
parameters when the observations are contaminated with sensor noise. We 
shall assume that the noise is additive. As a result the sensor output Y s 0 is 
related to the true output y 1 by a relation 

Y s 0 = y°+V (44) 

where V is an independent stochastic process with zero mean and finite 
variance representing the sensor noise. 
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The error between the observed output yj> and the model output is given by 

e(0) = /o - < 45 ) 

Since yj, is an outcome of the random variable Y s 0 , minimizing the sum of error 
squares using J(0) as defined by (23) or (24) yields an estimate which is 
random, and converges to the true value 0* as N -* oo. However, J(9) does not 
go to zero as N -* oo. It approaches a limit a 1 , the variance of the sensor noises. 

If the uncertain term Fin (41) or (45) is correlated (as opposed to being 
statistically independent) then the estimate 0 might not converge to 6* as 
N — * oo. In this case the method has to be suitably modified to take the 
correlation into account. 

4.2 Case Study J [Cont.] (Speed of Aeroplanes) 

In Section 4.4 of Chapter 7 we proposed three stochastic models involving 
regression formulations for this problem. In this section we consider 
estimating parameters for Models J.l and J.2(b). 

Model J.l 

The model is given by 

r"(r) = a,+ 6,(r-1900)+F^f) (46) 

with 

0 = {fl,A} (47) 

Using the method of least squares and the system data, for the period 
1900-1940, given in Table 2.6, we have the following estimates: 

d, =47.07, 0! = 11.35 (48) 

Model J.2 (b) 

The model is given by 

log r"(t) = a 3 + b 3 (t -1900)+ F"(r) (49) 

with 

0 = {a 3 ,b 3 j (50) 

The estimates using the method of least squares and the above-mentioned 
data are as follows. 

d 3 = 3.75, b 3 = 0.053 (50 

4.3 Change in Notation 

We depart from our earlier convention where we used superscript m to indicate 
model variables - e.g. and y m referred to the random output and its realized 
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value. Henceforth we shall drop the superscripting for model variables and use 
Y and y in place of Y m and y m . However, we shall continue to use superscript s 
for the system variables, as in the past. 

4.4 Method of Moments 

In the method of moment the parameters are obtained by the following three- 
step procedure. 


Step 1: Obtain expressions for the moments of Y in terms of the model 
parameters. Let Mj , theyth moment of F, be related to the model parameters 
0 by the function g/9), i.e. 


Mj = Em = gj(9), j>\ 

(52) 

Step 2: Obtain estimates of Mj using the system data available. Let the 
available data be given by 

{yf}, IsS/sSN 

(53) 

Then the estimate M, can be obtained as follows. 


II 

2: 

(54) 

Step 3: Obtain 0 by solving (52) with M, used instead of M } , i.e. 
equation 

by solving the 

90) = Mj, 1 <y < n 

(55) 

The method yields consistent estimates if Mj converge to the true 
as N -+ oo. 

In the case of stationary stochastic process formulation one needs moments 
involving the variable at two different time instants. For example, one might 
use 

Vj=E[Y i Y i+J ] = h J (0) 

(56) 

in the case of discrete time formulation, or 


v(t) = E[Y(t)Y(t + t)] = h(x) 

(57) 

in the case of continuous time formulation. The estimates for 
given by 

Vj and v(t) are 

V; = lW + y)/(/V ~j) 

i = 1 

(58) 

and 


V(0/U + T)}dr/(T-t) 

(59) 
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As the data length available for estimation (i.e. T or N)-+ oc the estimate 
0-.0* if Vj and v(t) converge to the true values Vj and v(r) respectively. 

Note that, as in the previous method, the parameters are estimated by 
viewing the system output as a realization of the model output. 

4.5 Case Study / [Cont.] (Component Reliability) 

In Section 6.7 of Chapter 7 we proposed two models for modelling the time to 
failure. In this section we will obtain estimates of the parameters for Model 1.1. 

Model 1.1 

Let the time to failure (or equivalently, the age at failure) for the ith control 
element be V,. We assume that Y„ 1 < / < n is a sequence of independent and 
identically distributed random variables with a distribution function F(y ,A) 
given by 

F(y,A) = 1 —e~ yix (60) 

In this case the parameter to be estimated, and the constraint, are given by 

0 = {A}; 0 = {A^O} (61) 

Since 0 is a scalar, we need to obtain an expression for one moment only. We 
shall use the first moment M„ which is given by 

M,(/.) = £{y} = A (62) 

Using the data given in Table 2.5, from (52) we have 

Mi = 1.561 (63) 

Using M,(/)= Mi, we have the estimate A given by 

A = 1.561 (64) 

4.6 Method of Maximum Likelihood 

Let f(yi,y 2 ,...,y N \ 0) represent the joint density function for the set of 
random variables (J^, Y 2 ,...y Vjv) of the model under consideration. Let 
y., 1 ^ i ^ N, be the observed (or realized) values of Y t . The likelihood function 
expresses the likelihood of this set of values being the outcome, and can be 
written as 

m = L(y.. y N \0) = f(yi,y 2 ,...,y N \6) (65) 

Given y„ 1 < / < N, and the model structure, the only unknown in L is 0, the 
parameters to be estimated. 

In the method of maximum likelihood the estimate 0 is obtained by 
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maximizing L(0) given by (65) with y h I ^ K N y being replaced by the 
corresponding system data yj. In other words, 0 is the model parameter which 
maximizes the likelihood of the observed data, yj, 1 < i ^ N, being the random 
outcome (or realization) of the model. 

0 can be obtained either by maximizing L(0) directly or by solving the set of 
equations given by 

dL(9)/dO = 0 (66) 

For a given system data (i.e. y-, 1 < i ^ /V), the 0 which maximizes L(0) also 
maximizes log L(0), as the logarithmic transformation is monotonic. In some 
cases it is simpler to deal with log L(0) instead of L(0). 

The maximum likelihood estimate has some very desirable and interesting 
asymptotic properties which are as follows: 

(i) 0 is asymptotically unbiased; 

(ii) the covariance of 0 is given by 

cov(0) = M~ l (67) 

where 

M = M(9*) = E[ {d log U9)/d9] T {d log U9)/d9) ] \ §me . (68) 

and the expectation is done over the observed random variables. A/(0*) is 
called the Fischer information matrix and will be used in the next chapter; 

(iii) 0 is normally distributed with mean equal to 0* and covariance as given in 

(ii); 

(iv) no other estimate has covariance smaller than M “ *. In other words, the 
maximum likelihood estimator is the most efficient estimator. 

The form for L(0) depends on the model formulation and the data available for 
estimation, as illustrated by the following examples. 

4.7 Case Study / [Cont.] (Component Reliability) 

We discussed the estimation of parameter for Model 1.1 in Section 4.5, where 
we considered the method of moments. In this section we discuss the method of 
maximum likelihood. 

Since the failure times Y h 1 < * ^ N, are a sequence of independent and 
identically distributed random variables with a distribution function given by 
(60), we have the likelihood function L(/) given by 

^)= fi /m 

i« 1 

= Z (l/^) N exp | - £ yJX j 


(69) 
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The maximum likelihood estimate, X, is given by the / which maximizes L(A) 
with 


y. = yj, 

Rather than deal with L(A) we shall consider its logarithmic transformation. 
As such, we have from (69) 


log LU) = - N log A - £ (yf)M 

i = 1 

where we have replaced y, by yj. X is obtained by solving 

d{logLU)}/d;. = 0 


(70) 


(71) 


which yields 


X=£ yj/w 

i= 1 


Using the data from Table 2.5, we have 

X= 1.561 


(72) 

(73) 


Note that for this model the estimate obtained by the method of maximum 
likelihood is the same as that obtained by the method of moments. In general, 
this is not the case. 


4.8 Case Study G [Cont.] ( Weather Changes) 

In Section 5.3 of Chapter 7 we proposed a model (Model G.l) involving a 
stationary Markov chain formulation with three states for modelling the 
changes in weather. In this section we consider parameter estimation for this 
model. 


Model G.l 

This model is given by a one-step transition matrix P, which is as follows: 



Pi i 

Pl2 

Pi 3 

p = 

Pu 

P 22 

P22 


Pu 

P 22 

P> 3 

0 = 

V/ 

■—t 

oT 

<3} 


(74) 

(75) 


and 

0 = {Os: p u 1 for 1 ^ ij < 3, and £ P tJ =1 for 1 < 3} (76) 

i = 1 

The variable Y, can assume only one of following three values: 1, if day t is 
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sunny; 2, if partly sunny - and 3, if cloudy. The system data the realized 
values yj, 1 ^ t 63; are as given in Table 2.4. 

The above situation corresponds to complete data being available. Let n s tj 
(1 < i.j < 3) be the number of transitions from state i to j over the interval of 63 
days. Then, from the data given in Table 2.4, we have n-j given by 

18 9 5 

9 1 6 (77) 

6 5 3_ 

with the element in the /th row and yth column representing n*j. 

To obtain the maximum likelihood estimate we need to obtain the joint 
distribution function for {N,j, 1 s? i,j < 3}, where N u represents the number of 
times the state has changed from i to j over the interval of 63 days. We omit the 
details of the derivation and present the final form of the estimator. The 
estimates are given by 


p ij = 


(78) 


for 1 < i, j < 3 and where 


"?= I < 


(79) 


Using the values given in (77) we have the maximum likelihood estimate P of P 
given by 


18/32 9/32 5/32 
P= 9/16 1/16 6/16 
6/14 5/14 3/14 

Note that the estimates satisfy the constraints given by (76). 


(80) 


4.9 Case Study L [Cont.] (Demand for Soft Drinks) 

In Section 7.4 of Chapter 7 we proposed a model (Model L.l) involving 
autoregressive moving-average formulation for predicting the demand for soft 
drinks. In this section we outline the method of maximum likelihood to obtain 
the model parameters. 


Model L.l 

The model is given by equation (53) of Chapter 7 and is reproduced below. 

Y '= t «jY-l+ 1 PjW,-}+W, (81) 

j =i j =i 


with 


0 = {a y , a 2 } 


(82) 
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and 

0 = {roots of P(s) = 0 and Q(s) = 0 lie inside the unit circle} (83) 

where 

P(s) = s n - X ( 84 ) 

j=\ 

Q(s) = s n + £ 

j= i 

The system data available for estimation are given by 

{y?}, 1 *5 f < N. (85) 

Recall that W„ t 5s 1 is a sequence of independent and identically distributed 
random Gaussian variables with mean zero and variance a 2 . 

Since Y„ I 2* 1 is Gaussian, the joint density function for (T„..., Y N ) can be 
obtained but is a complicated function of 0. An approximate maximum 
likelihood estimate, 0 , is obtained by minimizing: 

J(0) = £ {*M 2 /(N-«) (86) 

* = »+ 1 

where w k is generated by the relation 

** = yl - £ -i-£ PA-j (87) 

>=i i 

where w k =0, I k s$ n. The error between the true and the approximate 
estimates is of the order ( \/N) and is insignificant for large N. The estimate 6~ is 
given by the minimum value for J. 

The above minimization requires an iterative computational scheme where 
changes to the parameters must be done in a manner so as to ensure that the 
constraints imposed by (83) are not violated. We examine parameter 
estimation in ARM A models in detail in Chapter 16. 

5 Concluding Remarks 

As mentioned earlier, we have confined our attention in this chapter to the 
three most common methods of parameter estimation for stochastic models. 
Many other methods have been developed which are described in the 
literature. Specific details are given in the Notes that follow. 

Notes and References 

Items [l]-[3] comprise a small sample of the large number of books on 
parameter estimation in general. Most books on econometrics (e.g. [4]) and 
on regression modelling (e.g. [5] and [6]) deal with parameter estimation or 
regression formulations. 
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Methods for estimating dynamic model parameters can be found in [7]- 
[15]. In general, discrete time formulations have received more attention than 
continuous time formulations. Items [16]-[19] list a small sample of books 
which deal with special methods for discrete time formulations. For more on 
the equation error approach, see [20]. Parameter estimation for distributed 
systems (i.e. systems modelled by partial differential equations) can be found in 
[21 ]—[23]. 

We commented briefly on sensor noise affecting observed values. Some¬ 
times this is also called measurement error. For more on the analysis of 
measurement errors, see [24], 

As indicated in the previous section, we have omitted various methods for 
estimation. For further details on the method of quasi-linearization, see [25]; 
on stochastic approximation, see [26]; item [27] is an illustrative sample of 
books on robust statistical methods. In the text we mentioned some of the 
interesting and desirable asymptotic properties of maximum likelihood 
estimators. For discussion on small sample properties, see [28], 

Often the data available are incomplete. For more on estimation with 
missing data see [29], 

There are a large number of books devoted to estimating parameters for 
either a particular type of model formulation (e.g. [30]) or models from specific 
areas of application (e.g. [31]). The Proceedings of the IFAC Symposium on 
Identification and Parameter Estimation, held every third year since 1967, 
contain a large number of interesting articles dealing with parameter 
estimation for dynamic models. 
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CHAPTER 11 


Design of Experiment 


1 Introduction 

As discussed in Chapter 3, the first step in modelling is system characteriz¬ 
ation. This requires an understanding of the system under consideration. If 
the system has been studied in the past, and the model builder has inherited 
this understanding, then very little new information is needed to execute the 
first step in the modelling process. If this is not the case, then the model 
builder must first gain a satisfactory understanding of the system before the 
modelling process can commence. This understanding can often be achieved 
through experimentation. There are other instances where experimentation 
is necessary; for example to provide information about the system to permit 
estimation of model parameters as discussed in Chapter 10 or, ultimately, to 
validate the model. 

Experiments must be designed to yield relevant information. Optimal 
designs maximize the information obtained. The design of experiments 
depends on the type of information being sought and the reason for seeking 
that information. Thus, the design of experiment to improve understanding 
of causal relationships (needed for system characterization) will be different 
from those needed for estimating model parameters or validation. Since 
information is obtained by measuring one or more variables using sensors, 
often the information obtained is contaminated by sensor noise. Conse¬ 
quently, care must be taken in interpreting experimental results to avoid 
incorrect inferences. 

In most real-life situations the experiment must be done under one or 
more constraints. The constraints can be either technical or economic. 

xperimental design must take these constraints into account. Often a 
trade-off must be sought between the quality of information obtained and 
e time (or cost) required to obtain it. These make the design of experiment 
an interesting and challenging topic. 

oh!" thls .^ hapter we focus our attention on the design of experiments to 
Dtam additional information about the system under consideration. We 
r wit those that provide information helpful in characterizing systems, 
consider designs for static systems in Section 2 and for dynamic systems 
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in Section 3. Following this, wc discuss optimal designs for parameter 
estimation. This is done in Section 4 for static systems and in Section 5 for 
dynamic systems. In Section 6 we discuss a topic called response surface 
design”, which can be thought of as the combination of experimental design 
and optimization. Finally, we consider the design of experiments in a bigger 
framework involving additional features such as the time needed for 
experimentation, cost factors, quality of information etc. 

2 Design of Experiments for Static Systems 

(Causal Relationships) 

Suppose the model builder has identified N variables, X,( 1 < i < N). and wants 
to establish which of them influence some other variables F significantly. In 
other words, the aim is to find out which of the N variables, X,(l ^ i ^ N), 
are related to Y through a cause-effect relationship. 

In the literature on design of experiments, the variables X,(l < i < N) are 
called controlled factors. The reason for the terminology is that the 
experimenter can control them in some sense — i.e. in the selection of either 
the variables or the value assigned to them. Other variables which influence 
Y but which are not under the control of the experimenter are called 
uncontrollable factors. 

The controllable factors are categorized as either 

(a) qualitative - if they assume only discrete values; 

(b) quantitative - if they assume a continuous range of values. 

An example of the first type are the factors in Case Study B (Alloy Selection) 
discussed in Chapter 3, Section 6.2, where the two factors are (i) the variations 
in ore quality (denoted by MA) and (ii) the method of manufacture (denoted 
by ME). Since there are four different ore types and two different methods 
of manufacture, the values assigned to the two factors are given by the set 
{MA/A and ME/ 1 < j^2}. Here the factors represent qualitative 

(categorical) features of the system under consideration. Examples of the 
second type can be found in most chemical, physical and engineering sciences. 
An example from chemical engineering is as follows. X 1 and X 2 represent 
the temperature and pressure at which some reaction is allowed to take place, 
and Y represents the percentage yield resulting from the reaction. Here, X t 
and X 2 can assume a continuous range of values and the experiment is to 
obtain information so that X , and X 2 can be selected optimally to maximize 
the percentage yield. We shall discuss this type of experiment in Section 6. 

2.1 Experimental Design for Qualitative Factors 

As mentioned earlier, in the case of qualitative factors, each controllable factor 
can only assume values from a finite set. The values are sometimes called levels, 
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indicating the origin of this chapter topic in agricultural sciences. In an 
experiment the experimenter can assign different values (or levels)* for each of 
the controllable factors. Let the levels that can be assigned to factor 
X,(l < i < N) be given by the set 

W, 1 <;<*,} 

Consider an experiment to establish whether the controllable factors X, 
(1 < i < N) have significant influence or not on F, which is as follows. Study the 
effect on Y by varying one factor, say X*, with all other factors (i.e. X„ 
I < i < N, i # k) held fixed. If Y does not change significantly, then we infer that 
X* does not have a significant effect on F. This approach is called the “single¬ 
factor approach" as we consider one factor at a time. This approach can lead to 
wrong inferences, as illustrated by the following example. 


Example I 

Consider a problem in which there are two factors, i.e. N = 2. Let Y tik be the observed value of Y 
with X x = X\ and X 2 = X{ at the kth replication. Suppose that the relationship is given by 

( 1 ) 

with (RC)ij = 0 for the following two cases 

Case 1: i = /*, j arbitrary 
Case 2: j = j*, i arbitrary. 

[RC)ij is non-zero for all other combinations, and V iJk is an independent Gaussian random 
variable with zero mean and variance a 2 . 

Suppose that the single-factor approach is used with X , set to X\* and X 2 varied. This will result 
in the inference that X 2 has no effect on Y. Continuing the experiment with X 2 set to X{* and X x 
varied w ill also result in a similar inference. The inference drawn from the experiment is obviously 
incorrect. 

The reason for the erroneous inference in this example is that the single¬ 
factor approach yields incomplete information about the system, since all 
combinations of factor levels are not studied. Caution must be exercised to 
avoid falling into such traps. 

2.2 Factorial Design 

In factorial design, Y is observed for all possible combinations of factor levels. 
Since X { has n, different levels, the total number of combinations, NC , is given 
by 

wc= n m, (2) 

i = 1 

Each combination is called a cell - a term that once again reflects the origin of 
this topic in agricultural science. 

'Ve shall henceforth use the term “lever’ instead of “value”, so as to be consistent with the usage in 
the design of experiment. 
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For the two-factor case (i.e. N = 2) let Y)j represent the observed value of Y 
with X, = X\ and X 2 = X{. Then, the observed values of a factorial design 
experiment can be represented as a 2 x 2 matrix, shown in Table 11.1. If the 
experiment involves repeated trials for each cell (i.e. a particular combination 
of factor levels) we employ a third subscript k. Thus, with this notation, 
represents the observed value of Y with X, = X\ and X 2 = Xi at the (cth 
repeat. If there is no replication, then the subscript k is omitted. 

The advantage of factorial design is that it allows for testing various types of 
interaction between factors. In the case of N factors the first-order interactions 
are of the form X,X ; (i.e. both factors appearing together); the second-order 
interaction are of the form X, XjX m , and so on. By an analysis of the observed 
values of Y one can draw inferences whether one or more of the controllable 
factors have a significant influence on Y or not. This is done by a statistical test 
procedure which tests a null hypothesis that a factor X f has no influence at all 
levels with an alternative hypothesis that it has influence at one or more levels. 
Analysis of variance is one such method for testing. 

2.3 Analysis of Variance 

We shall briefly outline the basis of the method and the construction of an 
ANalysis Of VAriance (ANOVA) table for carrying out the test. We shall 
confine our discussion to the two-factor case - i.e. N = 2, where the outcome of 
the experiment can be represented in tabular manner as indicated in 
Table 11.1. 

The underlying basis of the analysis of variance was briefly discussed in 
Section 4.1 of Chapter 7. In the parametric model formulation the effect of X, 
and X 2 on Y is modelled by an algebraic equation formulation. It could be of 
the form 

T,jk = A + R, + Cj + V iJk (3) 

or 

Y ijk = A + Rj + Cj + (RC) U + V tJk (4) 


Table 11.1 Observed Outcome for Two-factor Factorial Design 
Experiment 




Factor X 2 





Xi X§ • 

xi 

X?' 


x\ 

V.. r.2 


r.., 


X? 




Factor X x 

X', 



Y, 


XV 

n, 


n„. 


DESIGN OF EXPERIMENT 


199 


where the set of parameters {A; R h 1 ^ ^ n x ; C j% 1 ^ j ^ n 2 \ (RC) ijy \ ^i^n x 
and 1 ^ j^n 2 } are interpreted as follows 

A: the overall average; 

Ri’ the average deviation in A due to ith level of factor X x ; 

Cj\ the average deviation in A due to yth level of factor X 2 \ 

( RC)ij : the average deviation in A due to interaction between the ith level 
of factor X x and yth level of factor X 2 . 

Since R i% Cj and (FC) 0 are average deviations from the overall average, they 
are constrained by the following constraints: 


fli 


o 

II 

Qsf 

(5) 

"2 

Z = 0 

(6) 

j= 1 


Z (KC)y = 0 l^i «Sn, 

(7) 

;=i 


Z (KC), =0 1 ^j^n 2 

i = 1 

(8) 


Note that if the relationship is given by (3), and if F, = 0,1 ^ i ^ n, then X , has 
no influence on Y. Thus in this case the null hypothesis 

R x = R 2 = ••• = F„, =0 (9) 

is to be tested against an alternative hypothesis that one or more of the F, 
values are non-zero. Also note that in (3) we assume there is no interaction 
between X x and X 2> whilst in (4) we allow for such interaction through the 
parameter (FC) U . 

The term V ijk is called the residual term, and represents the effect of 
uncontrollable (also sometimes called unassignable) factors. We assume that 
V ijk is an independent Gaussian variable with zero mean and variance o 2 . As a 
result, Y ijk is also Gaussian; the sums of mean squares and the ratios of the sums 
of mean squares are distributed according to x z and F-distributions respec¬ 
tively. The degrees of freedom for these distributions are functions of n x , n 2 and 
the number of replications. Thus, testing of alternative hypotheses can be done 
using tables for F-distribution. 

2.4 Case Study B [Cont.] (Alloy Selection) 

In Section 4.2 of Chapter 7 we formulated a model to study the influence of 
ore from different mines and the method of manufacture on the thermal 
coefficient of titanium alloy. The model (given by equation (9) of Chapter 7) 
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Table 11.2 Observed Values of Thermal 
Coefficient of Expansion for Case Study B 


Method 


Mine ore 

ME X 

me 2 

MA X 

5.78 

5.46 


5.28 

5.79 

ma 2 

4.84 

5.73 


6.28 

4.36 

ma 3 

6.82 

5.76 


6.77 

4.31 

MA a 

5.57 

5.30 


6.44 

4.86 


is identical to (3) with Y iJk corresponding to TC ijk with factors X , and X 2 
representing mine ore and method of manufacture respectively. As a result, 
we have r?j =4 and n 2 = 2. The data available to Alpha Inc. are the test 
results of two specimens (or samples) for each combination of mine ore and 
the method of manufacture - i.e. k = 2. Thus, in this case Y ijk represents the 
thermal coefficient of expansion for the /cth (1 < k < 2) replication obtained 
from a combination of ore from mine i (1 ^ i^4) and method j (1 ^j^2). 
The test results are given in Table 11.2. 

To carry out the analysis we construct the ANOVA table. For the general 
case it is given in Table 11.3, where the standard convention is used, i.e. when a 
subscript is replaced by a “ it corresponds to taking an average over that 
factor. 


Table 11.3 ANOVA Table for Two-factor Case 


Source 

Degrees of freedom 

Sum of squares 

Between rows 

<"i -1) 

*" 2 1 

i* i 

Between columns 

("j- 1) 

K»/i(Y+-Yj 

1= 1 

Ml M 2 

Interactions 
(row x column) 

1 

N 

1 

KIP' 

1= 1 1 

M| Mj K 

Residuals 
(within cells) 

n,n 2 (K - 1) 

1 1 lov-V 

j = I ; - 1 k = 1 

Ml MJ K 

Total 

/),n 2 (X - 1) 

1 1 y 

i-U-lk-I 


6=(Y,-Y l -Y x -YJ 
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Table 11.4 ANOVA Table for Case Study B 


Source 

Sum of squares 

D.F. 

Mean squares 

Mine/ore ( MA ) 

0.7623 

3 

0.2541 

Method {ME) 

2.4103 

1 

2.4103 

Interaction (MA-ME) 

1.8171 

3 

0.6057 

Residuals 

3.6825 

8 

0.4603 

Total 

8.6722 

15 



Thus, 

Yij. = X Yijk/K (10) 

k * 1 

U = I i *V*«i (11) 

i = 1 k = 1 

and so on. 

The ANOVA table using the measured values of Y from Table 11.2 is shown 
in Table 11.4. To test for significance, each of the mean squares is tested against 
the mean square of residuals. In the case of the first factor (i.e. mine ore) the 
ratio of the mean square to the mean square of the residual (i.e. 0.2541/0.4603) 
equals 0.55. Since the ratio is less than one, we see that the factor is not 
significant. In other words, the variations from mine to mine are not 
significant. Similarly, in the case of the second factor (i.e. method of 
manufacture) the ratio of the mean square to the mean square of residuals (i.e. 
2.4103/0.4603) equals 5.24. This ratio is distributed according to a F- 
distribution with (1,8) degrees of freedom. Since F 18 . 095 = 5.32, we see that 
the effect of the method of manufacture is just below the 5 per cent level of 
significance. The interaction between the method of manufacture and the mine 
ore is also not significant as the ratio of the mean squares (i.e. 0.6057/0.4603) 
equals 1.32. 

Since there appears to be a difference in the thermal coefficient of expansion 
between the two methods of manufacture, we need to examine the means and 
estimates of their standard deviations. Using the test results from Table 11.2 
we have the following result: 


Af£, ME 2 
Mean 4.97 4.20 

and the estimated standard error is given by 0.24 (i.e. v /'0.4603/8). 

Asa consequence. Alpha Inc. must choose Method 2 and reject Method I. It 
does not matter which mine ore is selected. 
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2.5 Other Designs 

The factorial design with N variables, Xj..., X N , and Xj having n t levels 
(1 ^ i < N) requires a total of NC cells with NC given by (2). This is a large 
number if N and the various n, are large. If three or more factors are considered 
to be independent, and their interaction is of no interest, then the factorial 
design can be modified to require fewer cells and hence fewer observations of Y. 

Latin square and Graeco-Latin square are two of the several designs which 
require far fewer cells than the factorial design. 

3 Design of Experiments for Dynamic Systems 

(Causal Relationships) 

We confine our attention to stochastic dynamic systems. Let X(f) and y(f) be 
two variables, and the aim to find whether they are related by a causal 
relationship. More specifically, we want to test whether X(t) has any effect on 

y(r). 

In order to devise a test one needs a proper definition for causality in a 
stochastic framework. For the case where t assumes discrete values (i.e. a 
discrete time dynamic system) Granger [33] provided a definition for X to 
cause T in terms of the predictability of Y. 

Let {/4(f): f = 0, ± 1, ± 2,...} be the given information including at least 

{*(0, y(t)}. Let 

,4(r) = {/4(s)|s < f} (12) 

and 

4(r) = {/4(s)|ssSt} (13) 

and similarly define X(f), X(t), Y(t) and Y(t). Let P{Y(t)\B} denote the 
minimum mean square single-step predictor of L(f) given an information set B 
and let o 2 { Y(t)\B} be the resulting mean square error (m.s.e.). Then Granger’s 
definitions are as follows. 

(i) X(f) causes y(f), if 

o 2 { y(t)|/4(f)} < a 2 { y(t)M(t) - X(0} O 4 ) 

(ii) XU) causes Y{t) instantaneously if 

<r 2 {Y(t)\A(t\X(t)} <o 2 {Y(t)\A(t)}. (15) 

In other words, (i) implies that if X causes Y then knowledge of X(f) reduces the 
uncertainty in the prediction of Y(f)- Similarly, (ii) implies that knowledge of 
X(f) reduces the uncertainty in the prediction of Y{t) because of the 
instantaneous causal relationship. 

We can use the above definition for continuous time by using a predictor t 
time units ahead (t > 0) in place of single step in the above definition. 
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Various tests have been developed for empirical detection of causality in 
discrete time systems based on the above definition and used extensively in 
modelling social and economic systems. 

4 Design of Experiments for Static Systems 

(Parameter Estimation) 

In Section 4.3 of Chapter 7 we studied regression models. The structure of a 
general regression model is given by an algebraic equation of the form 

Y i = g(X i> 0)+V i (16) 

where X, is a known input, F, is an unknown random input and y, is a random 
output. In Chapter 10 we studied methods for estimating the parameter 0 
given {Cki, X,), 1 i ^ N } where y, is the observed value of Y t . The estimate 0 is 
a function of the observed data and since the y values are realized values of the 
random variable Y. the estimate is itself a random variable. The variance of 
the estimate depends on the method used and the input X„ l^i^N. This 
suggests then that the X, values can be selected optimally so as to minimize the 
variance of the estimate in some sense. The selection of X„ 1 / jg N to 

minimize the variance is called “optimal design”, and in this section we 
examine this topic. 

We commence with a model formulation in which 0 appears linearly in 
g{X,0); i.e. g{X,0) is of the form 

g(X,0) = h(X)0 (17) 

We introduce various criteria for optimal design. Following this, we study the 
optimal design for the general case. 

We initially assume that K f , i> 1, is a sequence of independent and 
identically distributed Gaussian random variables with zero mean and finite 
variance, and that 0 is a p-dimensional vector. We assume that X is a m- 
dimensional vector and Y can be either scalar or vector. 


4.1 Linear Model 

In this case the estimate 0 obtained by the method of least squares is given by 


H = 


(H T H)~ 

l {H T y) 

(18) 

~h(X ,) ” 



h(X 2 ) 


(19) 

_h(X N )_ 



lyl .y£] 

(20) 


where 
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( 21 ) 


E[0] = 0* 

and 

CO v[0] = a 2 (// T //)' 1 (22) 

Since [ K, j is an uncorrelatcd Gaussian sequence the estimate obtained by the 
method of maximum likelihood is also the same. In other words 0 given by 
(18) is the minimum variance estimator. Note that the covariance of 0 is a 
function of Xj, X 2 ,---*X N . 

4.2 Optimality Criteria 

As the covariance of 0 depends on X h 1 it suggests that a good 

experimental design is one that makes this matrix small in some sense. Since 
there is no unique ordering of the p x p matrix (H 1 H) \ various real-valued 
functions have been suggested as measures of “smallness for defining the 
optimality criteria. Some of the well-known ones are as follows. 

(i) D -Optimality - A design is said to be D-optimal if it minimizes 

D = \(H T H)~ l \ (23) 

i.e. D is the determinant of the covariance matrix. 

(ii) A ’■Optimality - A design is said to ^-optimal if it minimizes 

A = lr{(H T H)-'} (24) 

where tr is the trace operator. 

(iii) E-Optimality - A design is said to be £-optimal if it minimizes the 
maximal eigenvalue of (H r H)~ l . 

The minimization is done with respect to X it 1 and Xj being 

constrained to belonging to some region X, a subset of R m if X is a m- 
dimensional vector. Thus the optimal design is reduced to a constrained 
function optimization problem. In general, a computational iterative scheme 
would be needed to obtain the optimal design. 

Note that, for the linear model, the optimal X*, 1 ^ i ^ N, are independent 
of the parameters being estimated. 

4.3 General Model 

We assume that V h i ^ 1, is a sequence of independent and identically 
distributed random variables with a density function/(i;) and is not necessarily 
Gaussian. As a result the maximum likelihood estimate 0 is given by the value 
of 9 which maximizes 

L= X log/(y,- 0 (Xi, 0 )) 

i= 1 


(25) 
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and the variance of the estimate is given by 

var0 = A/ _l (26) 

with 

M = M(0*) = E{ [dL/d0Y[dL/d9 ]} \ 9m9 . (26') 

where the expectation is done over the observed data and 0* is the true value of 

0 . 

As in the earlier case, the optimal design, for estimating the parameter 0, 
requires choosing X h 1 < / ^ N, to minimize a function of M. Thus the 
procedure is similar to the linear model case except the matrix (H r H )' 1 is 
replaced by the matrix M(0*). Note that if the sequence V i% / ^ 1, is not an 
independent sequence, then the log-likelihood function L is more complicated. 

However, in contrast to the linear model case, the optimal Xf , 1 ^ ^ N, 
are a function of 0 * the unknown true values of the parameter. This is rather 
awkward as the experiment is to obtain information for estimating the 
unknown parameter, but the design requires knowledge of this unknown 
quantity. 

One way of overcoming this situation is to model the initial (or prior) 
knowledge of 0 * by a distribution function, and consider M given by 

M = E{M(9*)} (27) 

where the expectation is done over 0*. 


5 Design of Experiments for Dynamic Systems 

(Parameter Estimation) 

The philosophy for optimal design of experiments to estimate parameters of 
dynamic system models is similar to that for static system models. We shall 
confine our discussion to models involving either ordinary differential 
equation or difference equation formulations. If the uncertainty in the system 
is insignificant and the variables are measured accurately (i.e. the sensor noise 
is insignificant) then the estimation problem is relatively straightforward and 
the design of experiment not difficult. The design of experiment becomes 
significant, as well as interesting, when there is uncertainty either in the system 
and/or in the measuring device, so that the observed system output variables 
must be viewed as realizations of a stochastic process. 

In Section 4.6 of Chapter 10 we discussed the method of maximum 
likelihood for stochastic models. As indicated there, for large data lengths, 
the maximum likelihood estimate is the most efficient estimate - i.e. no other 
estimate has smaller variance. The covariance of the maximum likelihood 
estimate 9 is given by the Cramer-Rao inequality, i.e. 

cov (0) = Af “ 1 


(28) 
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where M is the Fischer information matrix given by 


M = E[{5Lf[y];[x];0)/a0} r {5L([y];[x];0)/^} ]!»=». (29) 


where Y is the system output, x is the system input and [T] and [x] are given 
by 


[y] = m 

[x] = x(t), O^t^Tj 


(30) 


in the case of continuous time model, and by 

[y] = y„ OsStsjNj 

[x] = x„ 0 s$isSNJ 


(31) 


in the case of discrete time model. The expectation is done over [T] and 9 * is 
the true value of 9. 

Thus, in this case, we see that the variance of the estimate 9 is a function of 
the input [x]. In optimal design of experiment the input [x] is chosen so as to 
make the variance of the estimate small in some sense. 


5.1 Input Constraints 

Often the input that can be used in experimentation must be constrained due 
to physical reasons. For example, if the absolute magnitude cannot exceed a 
level A, then we have the constraint 

|x'(t)l<y4* (32) 

in the case of continuous time system and 

|xjK/l, (33) 

in the case of discrete time system. We have assumed that the input x is a in¬ 
dimensional vector and x‘ represents the ith component of x. 

In the case of continuous time system {x T (r)x(t)} represents the input power 
at time f. Hence, if the experiment be done for a period 0<t^T, then the total 
energy used is given by 

j* {x T (r)x(r)}dr (34) 

Often the total energy available is limited, and this gets translated into the 
constraint 

j* {x r (t)x(r)}df sSa (35) 

For the case of discrete time systems we have the energy constraint given by 

£xjx,^<x < 36) 

f = 1 
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The optimal design of experiment must take into account such constraints if 
they are relevant. 

5.2 Optimality Criteria 

If 0 is a p-dimensional vector, the covariance of 0, given by (28), is a p x p 
matrix. We need to define a suitable criterion to compare different designs 
and for selecting the optimal. As in the case of static system, a variety of 
criteria have been proposed - the more popular ones being the /4-optimality 
and the D-optimality. The two are similar to that defined in Section 4.2, and 
hence will not be repeated here. 

We illustrate by a simple example involving a first-order differential 
equation model. 

Example 2 

Suppose that the system is adequately modelled by the formulation 

rdy(0/<ir= -y(f) + x(f) ( 37 ) 

with r being the unknown parameter with r* its true value. The input x(/) is restricted to being 
sinusoidal, i.e. 

x(f) = sin(o>r) (38) 

with oj being the free parameter which the designer must choose optimally. 

We assume that the output of the sensor Y 0 (t) is related to the system output ><f) by the relation 

YM = y(t) + V(t) (39) 

with V{t) being a Weiner process with zero mean and unit variance. Note that V(t) represents the 
sensor noise. The observed output is given by y 0 (t), 0 ^ t ^ T. 

Let the criterion for selecting the optimal co* be given by 

<«M)=-log|M| (40) 

where M is the average information per unit time, i.e. 

M = M/T (41) 

with M given by (29). Since there is only one parameter (i.e. p = 1), the Fischer information matrix 
is a scalar and is given by 

M = T2cu 2 /{ 1 + (<or*) 2 } 2 (42) 

(The details of the derivation have been omitted, and can be found in the reference cited in the 
Notes at the end of the chapter.) 

As a result, using (42) and (41) in (40), we have 

<HM) = - log [2w 2 /{ 1 + (<ut*) 2 }] (43) 

The optimal to* is obtained by solving the equation 

d<j){M)/d(D = 0 ( 44 ) 

which yields 

< 0 * = I/t* (45) 

Note that the optimal frequency is a function of the true value of the parameter which is unknown. 

Thus, as in the case of static system, the optimal design suffers from the 
drawback that it is a function of the true value of the parameter - a quantity 
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which is unknown. This is rather awkward as the goal of the experiment is to 
estimate this unknown. Various approaches have been proposed to overcome 
this drawback, and we discussed one such in Section 4.3. 

5.3 Optimal Sampling 

In continuous time systems the system variables change continuously with 
time. Often, measuring these variables continuously in time is not practical. In 
some instances (e.g. for processing the data on digital computer) one needs 
only sampled values of continuous variables. 

When sampling is employed, the sampling strategy becomes a part of the 
design of the experiment. We shall confine our attention to uniform sampling - 
i.e. the variables are sampled at time instants t = k A, k = 0,1,2,...; A is called 
the sampling interval which the experimenter must choose carefully. 

In general, sampling implies a certain loss of information from the signal 
being sampled. However, if the signal is band limited to [ — /,/], then a 
uniform sampling at a rate f s ^ 2/ will result in no loss of information as the 
original signal can be reconstructed from the sampled values. This result is 
useful in the context of designing optimal input and sampling schemes for 
estimating parameters of linear dynamic models. 

If the system is adequately modelled by a deterministic linear time-invariant 
ordinary differential equation, then the output >-(/) resulting from sinusoidal 
input x(t) (given by (38)) is of the form 

y(t) = 4(cu) sin (cur + 4>(co)) (46) 

i.e. the output is also a sinusoidal signal of the same frequency but differing in 
both amplitude and phase. Thus, using a sampling scheme with sampling rate 
/, = (o/n (i.e./, = 2/) will imply no loss in information. 

The data available for estimation are the sampled values contaminated 
with sensor noise. The optimal design is obtained in the manner described 
earlier - i.e. optimizing a suitable optimality criterion involving the Fischer 
information matrix obtained from the likelihood function of the sampled 
values of the variables. We illustrate by considering the model of Example 2. 


Example 3 

The model is as given in Example 2, and the parameter to be estimated is t. We impose the 
constraint that x(r) is of the form given by (38), as a result of which y(l) is of the form given by (46). 
We employ a sampling rate 

/, = 2/ = <o/n < 47> 


The average information per unit observation is given by 

M = 2ita>/{ 1 + ,48) 

(The details of the derivation have been omitted, and can be found in the reference cited in the 
Notes at the end of the chapter.) Using the criterion given by (40), wc have the optimal w* given by 


<u* = l/lv^r*) 


(49) 
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and the optimal sampling rate 

ft = (o*/n = XH^/lnx*) (50) 

or 

A* = yflnx* (51) 

Note that again the optimal design is a function of the true value of the parameter which is 
unknown. 


. 5.4 Case Study H [Cont.] (Thermal Power Station) 

In Section 6.8 of Chapter 3 we briefly discussed the system characterization of 
a thermal power station for designing regulators. The function of the 
regulators is to change the operating condition of the boiler, turbine and 
generator as the electrical load changes. 

In order to design proper regulators one needs to model the relationships 
which link changes in input to changes in output for each of the three 
subsystems viz. boiler, turbine and generator. In this section we consider 
only the boiler, and confine our attention to the case where the changes are 
small so that the relationships can be adequately modelled by ordinary linear 
differential equations. The model is as shown in Figure 11.1, with the input 
variables being the changes in the firing rate and the valve setting and the 
output variables being the changes in the pressure and the steam flow. Note 
that the approach taken in building this model is the “black-box" approach, 
and each of the four boxes represents linear dynamic relations. The aim of the 
experiment is to obtain the parameters of the linear dynamic relationships. By 
using an approach along the lines of Examples 2 or 3 (depending on whether 



Figure 11.1 Model of boiler for Case Study H. 
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the output is measured continuously or is sampled), the input to yield optimal 
estimates can be found. 

6 Response Surface Design 

In its simplest form, response surface methodology uses simple mathematical 
models invoving low-degree polynomial formulations to approximate the 
relationship between the output variable (called response variable) and a set of 
input variables over a range of interest. Since the input variables can assume a 
continuous range of values, they can be viewed as quantitative factors affecting 
the response variable. 

The response variable represents a measure of the system performance - e.g. 
yield or efficiency. One of the aims of the response surface design is to improve 
the performance of the system by systematically varying the input variables 
using the measured values of the response variable. This use of the response 
surface methodology is known as evolutionary operation (EVOP). Various 
design schemes have been formulated and studied in this context. 

7 Other Aspects 

The discussion of the design of experiment so far has been done in a very 
narrow framework. The discussion has ignored features such as time and cost 
factors and quality of information. In this section we discuss the design of 
experiments which take these features into account. 

7.1 Time and Cost Factors 

As mentioned in Chapter 10, the variance of parameter estimate is a function 
of the data length used in the estimation. The larger the data length, the smaller 
the variance of the estimate, i.e. the variance is inversely related to data length. 
However, larger data length implies experimentation over a longer period of 
time. Since experimentation involves human and material resources, the cost 
of an experiment increases with the duration of the experiment. Thus a suitable 
trade-off must be made between the variance of the estimate and the cost of 
experimentation. This is achieved by defining a suitable criterion which 
incorporates the cost aspect as well as the quality of the estimate. A fairly 
general form for such a criterion is given by 

J = 0 (c,T,N) + tf>(M) (52) 

where (f>(M) is a measure of the quality of the estimate and g is a measure of the 
cost of experimentation. The parameters c, T, N and M have the following 
meanings: 

c = cost of experimentation per unit time, 
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T = total time period for experiment, 

N = number of points sampled if a sampling scheme be used, and 
M = Fischer information matrix. 

The function g is an increasing function of its arguments and the function 0 
decreases as the variance of the estimate decreases. Thus, in this case, the 
optimal experiment is the one which yields a minimum value for J. The trade¬ 
off is highly problem-dependent, as illustrated by the following two examples. 

Example 4 (Introducing New Brand) 

Suppose that a manufacturer of detergents is planning to introduce a new brand so as to increase 
his market share. A Markov chain formulation can be used to model the change in the market 
share with the release of the new brand. The entries of the one-step transition matrix would 
characterize the probability of switching between brands. The economic viability of introducing 
the new brand would depend crucially on the parameters of the one-step transition matrix. 

To obtain data for estimating the parameter, the manufacturer might decide to do some field 
trials. In this case he has to decide on what population size he should have in his field trial, and the 
number of weeks that he should carry out the trial. The larger the value for both these variables, 
the better is the quality of the data for estimating parameters. But this implies greater cost of 
experimentation. Thus, in this case, the manufacturer must decide on a suitable trade-off between 
cost of experimentation and quality of data obtained. 


Example 5 (Oil Drilling) 

Oil drilling is a costly and risky venture. Geological survey yields some data but is not very 
reliable. A more refined experimentation is the use of seismic exploration. This is more expensive 
but gives better-quality information about the presence of oil. Drilling sample holes is still more 
expensive but yields definite results regarding the presence of oil. Thus, in this case, the oil explorer 
must decide on how to spend his total exploration budget optimally so as to maximize the 
probability of finding oil, given that he can select different types of experimentation and that he has 
a choice of more than one site. This is a good example of the trade-off between cost and quality of 
information in experimentation. 

We end this chapter by considering the design of an experiment for Case 
Study I. 


7.2 Case Study I [Cont.] ( Component Reliability) 

In Section 4.5 of Chapter 10 we examined the estimation of the parameters of 
the distribution function for the time to failure. There we assumed that the 
failure data were given. When no failure data are available, the data must be 
obtained by experimentation. In this section we propose a variety of schemes 
for experimentation. 


Scheme 1 

Select N components and put them into operation on a test bench which 
simulates the working environment for the component. The experiment is 
stopped after a pre-specified time period T. Let K represent the number of 
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elements that fail during the experiment. This is a random variable assuming 
integer values in the interval [0, N]. The data available for estimation are 
given by x f , 1 ^ i^ K, the ages at failure for the K failed components and 
the additional information that (N - K) components have survived for a 
period of T 


Scheme 2 

Select N components and put them into operation on a test bench as in 
Scheme 1. The experiment is stopped at the instant when the nth failure occurs, 
where n is a pre-specified number. Thus, in this case, the time interval for the 
experimentation is a random variable. The data available for estimation are 
given by x„ 1 ^ ^ n, the ages at failure for the n failed components and the 
additional information that the remaining (N - n) components have survived 
till r given by 

t = max {x h 1 ^ i ^ n} (53) 

In both these schemes the experimenter has to select /V, the number of 
components to put on the test bench. In Scheme 1 the duration of experiment 
is fixed, whilst in Scheme 2 it is a random variable. The cost of experimentation 
under Scheme 1 is 

</[£(*)]+ cT (54) 

and under Scheme 2 it is 

dn + cEl t] (55) 

where d is the cost associated with the failure of a component and c is the cost 
of conducting the experiment per unit time. These costs have to be traded 
against the quality of the estimate obtained using the data from the 
experiment. 


Scheme 3 (Accelerated Testing) 

If the mean time to failure is large, then under Scheme 1 we might have K = 0 if 
T is small, and under Scheme 2 r becoming very large. One way of reducing the 
time for experimentation is to subject the component to a hostile environment 
which will hasten (or accelerate) the time to failure. This is achieved by 
changing the environmental conditions - e.g. the electrical and/or mechanical 
stresses applied, the temperature of the environment and so on. This approach 
is called “accelerated testing”. 

The technique is justified only when a relationship exists between the 
parameters of the failure distributions under normal and accelerated con¬ 
ditions. Various types of relationships have been advocated. The “power-law 
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relationship is one such, and is given by 

0 = 0 o (V/V o ) p (56) 

where 9 is the model parameter under a stress level V and 9 0 is the value under 
the normal stress level V 0 . Thus, if V > V 0 and p > 1 then 9 > 9 0 . If the mean 
time to failure is inversely related to 9 then increasing V implies a shorter time 
period for completing the test. 


Notes and References 

Design of experiments for static models have been studied for a long time and a 
large number of books are available on the topic. Items [l]-[8] are a small 
sample. In addition, there are many books on specific designs (e.g. [9]) or 
design of experiments for specific model formulations (e.g. [10]). The case 
where some data are missing is discussed in [11]. For more on optimal design, 
see [12]—[14]. 

In contrast to static models, the design of experiments for dynamic models 
has received attention only over the past 15 years. For more on design of 
experiments for models involving difference or ordinary differential equations, 
see [ 15]—[ 18]. Most of the new results can only be found either in journals or 
conference proceedings - e.g. [19] deals with the design of experiments for 
distributed systems (i.e. systems involving partial differential equation 
formulations). 

The detailed derivation of the results of Examples 2 and 3 can be found in 
[18]. 

Application of experimental designs for either understanding or parameter 
estimation in different disciplines has also received a lot of attention as seen 
through the large number of books that have appeared; [20]-[29] give an 
illustrative sample. 

For more on response surface methodology, see [30]-[32]. 
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CHAPTER 12 


Validation 


1 Introduction 

In Chapter 4 we defined a mathematical model as a symbolic representation 
involving an abstract mathematical formulation obtained by relating the 
mathematical formulation to a system characterization on a one-to-one basis. 
A mathematical model is called an adequate model if it is adequate for the 
purpose (or goal) in the mind of the modeller. In Chapter 1 we indicated the 
motivation for building mathematical models as being the desire to solve real- 
world problems. Thus a mathematical model is an adequate mathematical 
model if it captures the salient features of the system associated with the 
problem, and is capable of yielding a meaningful solution to the original 
problem. 

Validation is the process of establishing the adequacy of a given mathemat¬ 
ical model. It is the last step in the modelling process and, in some ways, the 
most difficult step to execute. It involves the definition of a test (or procedure) 
to decide whether a given model is adequate or not. If the model passes the test, 
then the modelling activity comes to an end and the model can be used with 
confidence to obtain solutions to the original problem. On the other hand, if 
the model fails the test, then one has to back-track and start a new iteration in 
the modelling process. 

In this chapter we focus our attention on validation. More specifically we 
concentrate on whether the model is in agreement with the system it is 
modelling. We commence with a brief discussion on the fundamental difficulty 
in validation; this is done in Section 2. Following this, we discuss the different 
approaches to validating models in a fairly general framework in Section 3. In 
Section 4 we discuss tests for validating deterministic models and in Section 5 
we consider the same for stochastic models. In Section 6 we consider the 
comparison and ranking of alternate adequate models. 

2 The Fundamental Difficulty 

As indicated earlier, validation involves defining a suitable test to decide 
whether the model is in agreement with the system in some sense determined 
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by the modeller. Thus, the crucial feature is the test used in validation. If the 
test is too stringent, then we have the following: 

(i) The modelling process will require a large number of iterations to 
complete the exercise. (This is based on the assumption that one starts with 
the simplest mathematical formulation and system characterization and 
progressively increases the complexity with each iteration.) 

(ii) The final model which passes the test will be too complex, and hence of 
limited use in solving the original problem. 

On the other hand, if the test is not stringent enough, then we have the 
following: 

(i) Practically any model will pass the test. 

(ii) The final model can be such a poor representation that it is practically 
useless for solving the original problem. 

This is the fundamental difficulty which makes validation the most difficult 
step in the modelling process. Whatever the test, the degree of stringency must 
achieve a suitable trade-off between the complexity of the model and the 
agreement between the model and the system being modelled. In modelling, 
one starts with a degree of stringency based on the model builder's intuition 
and past experience. It may well need altering during the modelling process so 
as to achieve a suitable trade-off of the kind discussed above. We shall discuss 
the degree of stringency further when we consider specific tests in later 
sections. 

3 Approaches to Validation 

The two distinctly different approaches to validation are as follows: 

Approach /: based on model assumptions, 

Approach 2 : based on model behaviour. 

We can combine elements of the above two approaches and define a third 
approach: 

Approach 3: based on model assumptions and behaviour. 

We shall briefly discuss each of the above three approaches and illustrate 
through examples. 

3.1 Approach 1 (Model Assumptions) 

In this approach the inadequacy of a model is established by a critical 
examination of the basic assumptions made in building the model. The 
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justification for the approach is that if the assumptions are invalid then the 
agreement between the model behaviour and the system behaviour will be 
poor, and hence and the model will not be adequate to solve the original 
problem. Thus, rejecting a model based on this approach saves the effort 
needed in the analysis, estimation, experimentation, etc., for the model. 

The rejection of a model based on this approach is highly problem- 
dependent, as illustrated by the following example. 


Example I 

A model is to be built to describe the motion of a simple pendulum. If the time interval of study is 
sufficiently small, so that the energy loss due to frictionai drag is very small, then the assumption 
that the frictional drag is negligible is valid. However, if the time interval of study is large, then a 
model built on the assumption that the frictional drag is negligible is not valid, since the effect of 
drag on the pendulum motion is cumulative. Hence, in this case the model should be rejected. 

In the above example, the decision to reject a model as not being adequate was 
based on the time interval of study and not on the actual system data i.e. the 
data for pendulum motion. In some cases one either requires system data or 
some analysis to establish the validity of the assumptions of a model, as 
illustrated by the following examples. 


Example 2 ( Weather Changes: Case Study G) 

For Case Study G (Weather Changes) we formulated three models (Models G. I to G.3) in Section 
5.3 of Chapter 7. Model G. 1 assumes a Markov property for the changes in the weather. To decide 
whether this assumption is valid or not will require system data and a suitable statistical test. 


Example 3 (Rocket Launch: Case Study D) 

For Case Study D (Rocket Launch) we formulated a model (Model D.l) in Section 5.4 of 
Chapter 5. The assumption that the thrust is an impulse is valid if the thrust lasts for a very small 
fraction of the total flight time; something that is not known initially. Only after a first solution can 
this be checked. This again emphasizes the iterative nature of modelling. 


It is important to note that valid assumptions by themselves are not 
sufficient to ensure the adequacy of a model. Hence, as indicated earlier, this 
approach is useful in establishing the “inadequacy”, rather than the “ade¬ 
quacy”, of a given model. In addition, this approach is of limited use in the case 
of models where the underlying assumptions are not stated explicitly, as with 
“black-box” modelling. 


3.2 Approach 2 ( Model Behaviour) 

In this approach the validity of a model is established by a test which evaluates 
the agreement between model and system behaviour. The assumptions made in 
model building are not important. What matters is how close the output of the 
model is to the output of the system for equivalently controlled input. The 
closeness is determined by the stringency of the test. The test involves the error 
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between the model and the system outputs in the case of deterministic models, 
and the statistical properties of the model and the system outputs in the case of 
stochastic models. 

Since this approach involves system data, it is desirable that the data used 
for validation be different from those used for estimating the model 
parameters, to ensure that validation is independent of parameter estimation. 
Typically, one should divide the available system data into two parts, with the 
first part (comprising the initial two-thirds of the total data) to be used for 
parameter estimation and the second part (comprising the final one-third of 
the total data) for validating the model. This is possible when the data 
available are sufficiently large. When the data are limited, the same data must 
be used for both estimation and validation. 

In Sections 4 and 5 we discuss tests for validating deterministic models and 
stochastic models respectively. 

3.3 Approach 3 (Model Assumption/Model Behaviour) 

In this approach the validity of a model is established by: 

(i) checking the validity of the assumption on which the model is built, and 

(ii) the closeness of the agreement between the behaviour of the model and the 
system. 

In other words, this approach is a combination of the two earlier approaches; 
it thus contains the good features of both. 

4 Validation of Deterministic Models 

We shall first consider models with one independent variable, i.e. models 
involving either ordinary differential equation or difference equation formul¬ 
ations. Let y*(t) and y m (t) represent the system output and the model output 
respectively for the same input u(t). The independent variable t represents time, 
and assumes a continuous range of values in continuous time modelling and a 
discrete set of values in discrete time modelling. The actual error between the 
two outputs, e(t\ is given by 

e(o=/(o-y"(o o) 

and the relative error by 

e(t) = e(t)//(t) (2) 

4.1 A Test for Validation 

The closeness between model and system behaviour can be defined in terms of 
an index J involving either the actual error e(t) or the relative error e(t) over the 
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time interval for which system data are available for validation. The index J 
characterizes the goodness-of-fit. Without loss of generality, we assume that 
the data available for validation are given by 

/(f), O^t^T (3) 

in the case of continuous time model and by 

/(',), O^i^N (4) 

in the case of discrete time model. 

A simple form for J is given by 

7 = j e T (t)e[i)dt (5) 

or 

J = jV(f)e(f)df (6) 

for continuous time model, and by 

J = t (7) 

i = 1 

or 

J = t e T iU)e«i) (8) 

i= 1 

in the case of discrete time model. Other forms for J can also be used. 

We accept the model as an adequate representation of the system if J is less 
than some pre-specified value 6. If J is bigger than d the model is regarded as 
being an inadequate representation, and hence rejected. 

The value assigned to S determines the stringency of the test. As <5 gets 
smaller, the test becomes more stringent. The value one assigns to S is 
subjective, and must be based on a suitable trade-off between the goodness-of- 
fit and the complexity of the model. 

Suppose that S is small and that J is given by (5)-(8). Can one be sure that the 
test will always ensure that the model behaviour is in good agreement with the 
system behaviour? The answer is NO as illustrated by the following example. 


Example 4 

Let fit) be the system output and >7(0 and >7(0 be the outputs of two different models (M, and 
M 2 respectively), as shown in Figure 12.1. Then using the above test with 6 fairly small (^ 10" 2 ) we 
have model .V/, pass the test whilst model M 2 fails the test. However, one should prefer M 2 to M, 
as M 2 models the significant feature (i.e. large rise and fall) whilst M, does not 

The above example suggests that the simple form for J given by (5)—(8) is not 
the most appropriate for establishing the adequacy of a model. In other words, 
we need to use more complex forms for J . 
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Figure 12.1 Plot of system and model responses for Example 4. 


4.2 Characterization of Goodness-of-fit 

The formulation of a suitable index J to characterize the goodness-of-fit is a 
non-trivial task. It must ensure that the model behaviour matches the system 
behaviour and reflects the salient feature of the system behaviour. The salient 
features could be, for example 

(i) the number of turning points in the system output; 

(ii) the maxima (or minima) of the system output and the time instants at 
which they occur. 

Thus a more suitable index for Example 4 would be 

J = y , fj MO} 2 df + y 2 1 /(f 1 ,) - y M (f 7)1 + V s I *7 - 41 (9) 

where tj[f7] is the time instant at which /(f) [/"(f)] reaches maximum; y h 
1 < i < 3, are non-negative constants reflecting the relative weights attached to 
the various deviations between model and system responses. 

4.3 Two or More Independent Variables 

In this case both the model and the system responses are functions of the 
independent variables (x,y,z) representing spatial coordinates, and t repres¬ 
enting time. The goodness-of-fit index, J, must take into account the error 
between the two responses over the range of the independent variables. In the 
case of two independent variables, with the ranges given by a^x^b and 
0 ^ f ^ T, two simple forms for J are as follows: 

J = f dx f (e(x, f)} 2 df 00) 

Jo Jo 
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with e(x, f), the error between the responses given by 

e(x, f) = /(x,f)-/"(x,0 


or 



{e(Xt,t)} 2 dt 


with a < x { < x 2 <....< x N ^ b, and the x f , 1 ^ j ^ N specified. 


( 11 ) 

( 12 ) 


4.4 Effect of Sensor Noise 

So far we have assumed implicitly that the system output is measured 
accurately. In other words, there is no error in the sensor used for measuring 
the system output variable. Often the measured value of the system differs from 
the true value due to the influence of sensor noise. In this case the measured 
output /(f) is related to the true output y*(r) by 

m=m+v(t) d3) 

with V(t) representing the sensor noise. As a result the test needs to be modified 
to take into account the influence of V(t). Note that with sensor noise, the 
index J (given by (5)-(8)) is greater than zero even if /"(f) = /(r); i.e. the model 
output is identical to the true system output. 

If V(t) is a scaled Weiner process with a scaling factor a, /(f) and y m (t) are 
scalar and J is given by (5), then 

E[J] = a 2 T+J e 2 (t)dt (14) 

where e(t) is given by (1). 

This suggests the following modification to the test given earlier. Accept the 
model if J < a 2 T + S and reject the model if J > a 2 T + 6 with <5 > 0. As with the 
earlier test, d reflects the stringency of the test. However, this is not a good test 
for establishing the adequacy of a model. If T is small, then the difference 
between J and its expected value has high variability and hence the test can 
result in wrong inferences. On the other hand, if T is large, then a 2 T tends to 
dominate d and the effect of S is lost. 

An alternative approach is to evaluate the difference between the measured 
system output and the model output; i.e. /(f) — y m (f), for discrete values f„ 
1 ^ i ^ /, in the interval [0, T}. If y m (t) is fairly close to /(f), then /(f,) - y m (f t ), 
l^i^/, is a sequence of random variables with approximately the same 
statistical properties as the sequence K(f,), 1 ^ / < /. Since K(f,), 1 < i < / is 
a sequence of independent and identically distributed Gaussian random 
variables with zero mean and finite variance, the adequacy of the model can 
be established by a suitable statistical test involving the statistical property 
of the sequence /(f,) — y m (f,), 1 ^ ^ / and comparing it with the corres- 
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ponding statistical property of the sequence K(r f ), 1 ^ i ^ /. We shall discuss 
such tests in the next section. 


4.5 Case Study A [Cont.] (Weight Lifting] 

In Section 3.2 of Chapter 5 we proposed three models (Models A.l to A.3) 
which relate the weight lifted (L) to the body weight of the lifter (W). We 
considered estimating model parameters in Section 3.5 of Chapter 10 using 
the data given in Table 2.2. 

As the data length is short, we used all the data in parameter estimation and 
hence there are no independent data for validation. As such, we are forced to 
use the same data for the purpose of validation. 

We use J given by 

J=i UM^,) - LJWM/LAW,)} 2 (15) 

I = 1 

as the index of the goodness-of-fit. L S (W / I ) is the actual weight lifted by a lifter 
with body weight and L m (VF,) is the corresponding value predicted by the 
model. We accept the model as being adequate if J ^ d with d = 10 -3 and 
reject it as being inadequate if J > d. 

Using the model parameters estimated in Section 3.5 of Chapter 10, the 
values for J for the two models are as follows. 

Model A.l J = 1.035 x 10’ 3 
Model A.2 0.406 x 10' 3 

Model A.l is not an adequate model as J > d, whilst A.2 is. If 6 is reduced to 
10" 4 , then we see that neither of them is an adequate model. 


5 Validation of Stochastic Models 

In stochastic system modelling, the data available for validation \ ^i^N 
are a realization (or outcome) of the system output variable TJ, I < / < N. Let 
Y7\ 1 ^ ^ m, represent the variables of the model which correspond to Y*. If 
the model be simulated (as indicated in Chapter 9) we have a set of model 
outcomes yj", 1 < i ^ N which can be viewed as a realization of TJ". 

One might be tempted to use the test of the previous section (i.e. based on the 
goodness-of-fit between yj and yj", 1 ^ i < N) to validate the model. Unfortu¬ 
nately, this is not meaningful as the realized value of the model (i.e. yT* 
1 ^ i ^ N) changes if the simulation is repeated. Thus, one needs a different 
approach for validating stochastic models. In this section we discuss two 
different approaches. These are statistical comparison of responses and 
hypothesis testing. 
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5.1 Statistical Comparison of Responses 

Let Q s represent some statistical characterization of the system response. It 
could be, for example, the probability density function or the moments of the 
system output. Note that Q s is deterministic. Let Q m represent the statistical 
characterization for the corresponding model response. 

Since both Q s and Q m are deterministic, we can use the error between the two 
as an indicator of the mismatch between the system and the model, with the 
mismatch being viewed in a statistical sense. 

A Simple Test for Establishing Model Adequacy 

A simple test is as follows: define a goodness-of-fit index J as a functional of the 
error between Q m and Q s in a manner similar to that in the previous section. If 
Qs[Qm \' s l ^e stationary probability density function p 5 (y) [p m (y)] of the system 
[model] outputs Y s (t) [FJO] and the outputs are scalar, then a suitable J is 
given by 


J = j {p*(y)-Pm(y)¥<iy 06 ) 

WQslQm] are the first J moments 1 ^ j ^ J, then a suitable J is given 

by 

} 2 (17) 

j = 1 

As in the previous section, we accept the model as being an adequate model if 
J ^ 6 where S is a specified positive quantity. If J > 6 then the model is rejected 
as being inadequate. The value of S determines the stringency of the test. 

For a given model, Q m can be obtained by either analytical or comput¬ 
ational methods. Unfortunately Q s cannot be obtained in a similar manner. 
However, an estimate (5 S of Q s can be obtained using the system data available 
for validation. Thus, the testing for adequacy is to be done using Q s instead of 
Q s . The drawback of this test is that if the data available for estimation are 
limited then Q s is a poor estimate of Q s and hence the test can lead to wrong 
inference. 

5.2 Hypothesis Testing 

In hypothesis testing we use the observed system data to test a null hypothesis 
against an alternative hypothesis. In the context of model validation the null 
hypothesis would be that the observed system data have come from the model 
under test (in other words, the model is an adequate representation of the 
system), and the alternative hypothesis would be that it has come from some 
other model. 
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The decision to accept or reject the null hypothesis is done by a test which 
divides the space of observations into two regions -A and B - the null 
hypothesis is accepted as being true if the observed data falls in region A and 
rejected otherwise. As a consequence, one can make two types of error: 

Type I error: Rejecting the null hypothesis when it is actually true. 

Type II error: Accepting the null hypothesis when it is false. 

The probabilities of making these two errors depend on how the space of 
observations is divided. It is impossible to make both these probabilities 
arbitrarily small. In general, tests are designed (i.e. the division of the space of 
observations into two regions) such that the probability of Type I error does 
not exceed some pre-specified value a. This value is called the significance level 
of the test. The probability of Type II error be given by /?; (1 - (I) is called the 
power function of the text. A test in which a is fixed and (1 — P) is maximized is 
called the uniformly most powerful test. 

Various methods have been proposed for the division of the space of the 
observations. Depending on the model formulation, the decision to accept or 
reject can be reduced to testing a statistic involving the null hypothesis and the 
observed data. We shall illustrate this point by considering three specific 
model formulations. We omit the details of derivation, and interested readers 
should consult the reference cited at the end of the chapter. 

5.3 Validating Markov Chain Model 

Let the model be given by a Markov chain formulation with r states and the 
parameter of the one-step transition matrix be given by 

Pij = P?r 1 (18) 

In this case the null hypothesis is that the state of the system evolves with the 
one-step transition matrix given by (18). 

Using the system data available for validation, we obtain estimates P l7 , and 
compute a statistic J given by 

j=i i*i(pij-p?j)/p?j (i9) 

.= i;= i 

where n, is the number of times the process was in state i over the interval 
of the data used in the validation. Asymptotically J is distributed according 
to a x 2 distribution with (r(r — 1 ) — d) degrees of freedom, where d is the 
number of which are zero and the summation in (19) is done only over 
(ij) for which P° > 0. 

From the tables of x 2 distribution we evaluate Pr{* 2 ^ J} with J given by 
(8). We accept the model as an adequate representation if 

Pr {x 2 >3} <a 


( 20 ) 
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for a given level of significance of a. If (20) is not satisfied, we reject the model 
as being adequate to represent the system. 

5.4 Validating Deterministic Model with Sensor Noise 

We considered the effect of sensor noise in Section 4.4. Using the notation of 
that section, we have the error 

40 = yo(0-y m (0 (21) 

to be approximately equal to V(t) if the model is an adequate model - i.e. the 
model response is very close to the system response. 

If V[t) is a sealed Weiner process, and the model adequate, then eO,), 

1 is a sequence of independent and identically distributed random 

variables with zero mean and variance a 2 . In this case a model can be 
validated by testing the null hypothesis that «?(/,.), I ^ ^ «, is a sequence of 
independent and identically distributed random variables. If o 2 is known, 
then the test would involve a / 2 distribution. If a 2 is not known, then the 
test would be based on a Student distribution. As in the previous example, 
the acceptance or rejection of the null hypothesis will depend on the signi¬ 
ficance level. 

If one does not make any assumptions about the distribution of |/(r,) then 
non-parametric tests can be used to decide whether the null hypothesis is 
true or not. 

5.5 Validating ARM A Model 

Let the data available for parameter estimation and validation be the system 
output given by >*, Ui^/V + M. We use the initial data - viz. 1 ^ ^ N 
for parameter estimation and the remainder viz. N+\^i^N + M for 
validation. 

Assume that the model is given by an ARMA formulation 

y? = Z W’T- i + z P,0), + U), (22) 

1=1 i= 1 

where the order n is specified, and the parameters estimated by the method of 
maximum likelihood - viz. by minimizing where <b, is given by 

n n 

(b t = — £ ^i^i- j X (23) 

i= 1 i=l 

The underlying assumption of ARMA formulation is that cu„ t^O is an 
independent and identically distributed Gaussian sequence with zero mean 
and variance a 2 . 

We see from (22) and (23) that if the model is an adequate representation 
then (cu,) must be a sequence with statistical properties identical to that for the 
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{a>,} sequence. Thus a test for establishing adequacy can be devised by 
checking for the independence of the J<u,{ sequence. In other words, the null 
hypothesis is that the sequence {u>,} * s an independent Gaussian sequence with 
zero mean and variance a 1 . 

Towards this end, let R k be the kih sample autocorrelation of u>, obtained by 

K* = { 'l +!)]}/[ jt ^/[N-(«+!)]} (24) 

1 ^ k < K and K « N. If the model is an adequate representation, then R k is 
approximately Gaussian in distribution with zero mean and variance propor¬ 
tional to 1/N, for 1 «$ K. Also, are statistically independent. 

Thus, a simple test for model validation is as follows: if R k , 1 < k K , lie in 
an interval - S/yJN to + 6/jN, <5 > 0, then the null hypothesis is true and the 
model an adequate representation. As 6 gets smaller the test becomes more 
stringent. 

An alternative test which requires K > n is as follows: compute Q by the 
relation 

Q=t R k (25) 

it = i 

If the model is adequate, then Q is distributed according to a y 2 distribution 
with (K — n ) degrees of freedom. Using tables for y 2 distribution, one can 
decide whether the null hypothesis is true or not, for a given significance level. 

Note that the tests described so far validate the model in terms of the 
underlying assumptions (i.e. in the spirit of Approach 1). We combine this 
with a test involving system and model response - i.e. comparing yj - yf 
for N + 1 < f < N + M, then the validation is in the spirit of Approach 3. 

5.6 Case Study J [Cont.] (Speed of Aeroplanes) 

In Section 4.2 of Chapter 10 we discussed the problem of estimating the 
parameters of Models J.l and J.2(b) for predicting the speed of aeroplanes in 
the year 2000. The estimation was done using the historical data (from 
Table 2.6) for the period 1900-1940. 1 n this section we discuss the validation of 
these two models using the remaining part of the data - i.e. the data covering 
the period 1941-1965. This period contains 21 data points and hence we use 
the following criterion for the validation purpose. 

Define <p as 

<t>=£ {|xj-x-"(0|/xf)/21 (26) 

l = 1 

where t h 1 < i ^ 21, are the time instants at which the speed changed due to 
technological improvements. 

We accept the model as being adequate (for the purpose of forecasting) if 
< p ^ 0.05 and reject the model \f (p> 0.05. 
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Model J.l 

Using the parameter values of Section 4.2 of Chapter 10, the forecasts are 
given by 

x m (t) = 47.07 + 11.35(t - 1900) (27) 

A plot of the forecasts, along with the historical data is given in Figure 12.2. It 
is clear that </> > 0.05 and hence the model must be rejected. 

Model J.2(b) 

For this model the forecasts are given by 

x m (r) = exp {3.75 -f 0.053(r - 1900)} (28) 

and plots of model output and historical data are given in Figure 12.3. In this 
case 4> is < 0.05 and hence we can accept the model as being adequate for 
forecasting purposes. The forecast value for the year 2000 is 16,000 mph. 


Comment. Since the model assumes an exponential growth, it is doubtful if 
the model can be used with credibility for long-term forecasting. For short¬ 
term forecasting the credibility is not in dispute. 

Model J.3 

We discussed this model briefly in Section 4.4 of Chapter 7. This model views 
changes in maximum speed as being due to improvements within a technology 
and due to changes in the technology. The changes in the technology are 
modelled by a scale factor of 5 and 15, as discussed in that section. As a 
result, the model for forecasting is given by (27) for 1900 ^ t ^ 1940 by 

x m (t) = A + 56.75(f - 1940) (29) 

for 1940 ^ t ^ 1980, where A is the forecast value from (27) for r = 1940, and by 

x m (0 = B + 170.25(f - 1980) (30) 

where B is the forecast value from (29) for t = 1980. 

The forecasts from this model, along with the historical data, are shown in 
Figure 12.2, and in this case again we have <f> < 0.05, implying that the model is 
adequate for the purposes of forecasting. The forecast value for the year 2000 is 
given by 8700 mph. 

Comment. This model is superior to Model J.2(b) as it characterizes the 
effect of change in technology more explicitly, as opposed to merely assuming 
an exponential growth. 

We could have used alternative tests involving the residuals (i.e. difference 
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t(years) 

Figure 12.2 Forecasts, Models J.l and J.3. 
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between the data and the model predictions) since for an adequate model these 
would be a sequence of independent and identically distributed random 
variables. The test employed in this section is more in the spirit of Approach 2, 
whilst a test based on residuals would be in the spirit of Approach 1. 

5.7 Case Study G [Cont.] (Weather Changes) 

The model described in Section 5.3 of Chapter 7 for weather changes involved 
parameter estimation using the data given in Table 2.4. In this section we 
assume that the one-step transition matrix is given by the matrix 

(31) 

Thus, the null hypothesis is that the one-step transition matrix P is given by 

P = P° (32) 

We want to test this against the alternative hypothesis that 

P*P° (33) 

We shall assume that the data available for testing this hypothesis are the data 
given in Table 2.4. (Note we have estimated P° using some other data.) 

We follow the procedure outlined in Section 5.3 and evaluate the statistics J 
given by (19). The P u and n, are the same as given in Section 4.8 of Chapter 10. 
Using these values and that given in (31) we have 

J = 3.3369 (34) 

If P° is the true one-step transition matrix, then J is distributed according to a 
y 2 distribution with 5 degrees of freedom. This follows as P° has one zero entry. 
From the tables of y 2 distribution with 5 degrees of freedom we should accept 
the model if J < 1.145 and reject the model if J ^ 1.145 for 5 per cent level of 
significance. Since J > 1.145, we must reject the null hypothesis and view the 
model as being inadequate. 


P° = 


0.55 0.30 0.20 
0.60 0.00 0.40 
0.50 0.30 0.20 


6 Selection Among Alternative Models 

In general, there will be more than one model that will pass the validation test. 
This is to be expected, for in modelling one takes a pluralistic view of the 
system being modelled; hence there will be more than one adequate model. 
Thus, given two or more different adequate models the question that arises is 
the following: Is one of them better than the rest in some sense? In this section 
we briefly discuss two factors that can be used to rank different models (for a 
given system) to indicate the best. 
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6.1 Principle of Parsimony 

In this, a model M { is preferred to a model M 2 if M, has fewer parameters. 
Thus models can be ranked in terms of the number of parameters in the model. 
This type of ranking is meaningful, for the cost and effort involved in 
parameter estimation and design of experiment increase with the number of 
parameters. 

6.2 Parameter Sensitivity 

If a model response is highly sensitive to the parameters of the model, then the 
model is of limited use for prediction purposes, as small errors in parameters 
will result in large errors in the model response. Thus, models can be ranked in 
terms of the sensitivity of the response to changes in parameter values. This 
requires defining a suitable sensitivity index. One such is the following: 

m = r WWyMWW* 6 *} * 35 ) 

where 59 is the change in 0 and Sy is the change in model output due to change 
in the parameter. (It is assumed that both 0 and y(t) are vectors.) The above 
index is one of many that can be used. Also, the ranking based on one index 
might not agree with the ranking with a different one. 

Notes and References 

Validation is in some sense the most difficult step to execute in the modelling 
process. In general, books on mathematical modelling contain very little 
discussion on the topic. 

Most of the work done on model validation is scattered in various journals: 
[1]-[11] give a small sample; [12] and [13] examine model validation for a 
class of model formulations. 

There are a large number of books on parametric as well as non-parametric 
methods for testing (e.g. [ 14]—[ 16]) which discuss various statistical tests of 
use in modelling. For more on hypothesis testing, see [17]. 

Further details of the derivation of the results of Section 3.6, see [18]. 
Validation methods for time series modelling can be found in a large number 
of books on time series modelling (e.g. [19]). 

For further discussion on the role of parsimony, see [20]; on sensitivity, see 
[21] and [22]; and on complexity, see [23]; [24] and [25] discuss validation 
of models in economics. 
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CHAPTER 13 


Pitfalls in Modelling 


“A pitfall is a conceptual error into which, because of its specious plausibility, people 
frequently and easily fair (Majone, 1980). 


1 Introduction 

Since mathematical modelling is a multi-stage activity requiring a variety of 
concepts and techniques, there are many pitfalls which, if not avoided, can lead 
the unwary modeller to produce absurd models and hence absurd solutions to 
the original problem which motivated the modelling exercise. In this chapter 
we focus our attention on these pitfalls in mathematical modellling. 

2 Pitfalls 

Some of the main pitfalls which the model builder should be aware of are listed 
below. 

(1) Forgetting that modelling is not an end in itself but is only a means 
towards solving real problems. 

(2) Failure to understand the problem properly and as result producing 
models of limited value. 

(3) Forgetting that a model is only a simplification of the real world 
associated with the problem, and that the two are not the same. 

(4) System characterization being not appropriate for the problem requiring 
solution. 

(5) Confusing correlation with causation and vice-versa in system 
characterization. 

(6) Using a mathematical formulation which is not well posed in a 
mathematical sense. 

(7) Failure to check the validity of simplification and/or the correctness of the 
use of mathematical techniques. 

(8) Failure to recognize the ill-conditioned nature of the formulation in the 
case of solving ill-conditioned formulations on computer. 

(9) Using inconsistent estimators for parameter estimation. 
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(10) Poor or incorrect design of experiment which yields little new or relevant 
information. 

(11) Failure to interpret properly the results of mathematical analysis in 
physical terms of the problem. 

(12) Poor selection of criterion for validation. 

(13) Failure to critically evaluate the limitations of the model and the 
credibility that one should attach to the solutions obtained. 

(14) Forcing the problem to fit a particular model rather than choosing the 
most appropriate model for the problem. 

(15) Failure to achieve sensible compromise between conflicting factors. 

(16) Forgetting that the aim is to produce the simplest adequate mathematical 
model. 

(17) Failure to recognize one's own limitation. For complex problems, one 
needs people with different backgrounds to build an adequate model. 

(18) Failure to differentiate “the tree from the woods". This can lead to 
focusing the attention on specific aspects and ignoring other important 

ones. 

(19) Failure to document the model properly so that the assumptions and 
limitations are obscured. 

3. Final Advice 

The best advice for intended mathematical model builders is summed up in the 

following analogy due to Golomb [2]. 


A rocky road filled with pot-holes and a large sign in the foreground warning - PROCEED 
WITH CAUTION. 


With this word of caution we come to the end of Part I. 
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CHAPTER 14 


Dynamics of Malaria 
Spread 


1 Introduction 

Malaria is a tropical infectious disease which menaces more people in the 
world than any other disease. The World Health Organization reported in 
1978 that, in Africa alone, one million children die annually from malaria 
before they reach the age of 5. It is conservatively estimated that malaria 
victims number around 100-200 million annually. Various schemes have been 
advocated for control of malaria but success has been limited. For the control 
strategies to be more efficient, improved epidemiological knowledge is 
required, including a better understanding of the population dynamics leading 
to the spread of malaria. In this context mathematical models play two 
important roles. Firstly, they help in clarifying the interaction between 
different variables of importance and lead to practical insights into the 
mechanisms underlying the observed phenomena. Secondly, they are useful in 
making a quantitative evaluation of the impact of different control strategies, 
so helping in the selection of optimal strategies for controlling the disease. In 
this chapter we focus our attention on building models to understand the 
process - in other words, to develop a mathematical theory for the spread of 
malaria. We commence with a brief discussion of the biology of malaria in 
Section 2. Section 3 studies the system characterization aspect. Section 4 looks 
at mathematical modelling in a general context and discusses the problems 
associated with data for parameter estimation and model validation. The 
remainder of the chapter deals with a variety of mathematical models. Each 
section discusses a specific model starting with the building and finishing with 
critical evaluation. We end with a concluding section which discusses further 
models that can be built. 

2 Biology of Malaria 

Malaria is an infectious parasitic disease in the sense that the disease is 
transmitted at some stage in the life cycle of a relevant organism (also called 
parasite) from an infected host to an uninfected host (also called susceptible). 
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The transmission of malaria is relatively easy to describe. The parasites 
causing malaria in humans are from the species of protozoa in the genus 
Plasmodium , and they are transmitted to humans by female mosquitoes 
belonging to certain species of the genus Anopheles . The female mosquitoes 
pick up the infection when they bite infected humans. Thus, the parasite has 
two hosts - humans and mosquitoes. 

The Plasmodium life cycle can be described as follows. The parasite evolves 
through different stages during its life cycle. We start with the sporozoite 
stage - the stage where they reside in the salivary glands of infected female 
mosquitoes. When an uninfected human subject is bitten by an infected female 
mosquito, Plasmodium cells in the sporozoite stage are injected into the human 
blood. They are carried to the liver where they complete their growth and 
multiply asexually to produce merozoites . These then pass out of the liver into 
the blood stream and enter the red blood cells where further asexual 
reproduction takes place. In the process the red blood cells are destroyed, thus 
releasing more merozoites to attack healthy red blood cells. This cycle is 
repeated until the concentration of parasites in the blood reaches a high level, 
at which time the symptoms of malaria may appear in the host. The symptoms 
are chill, shivering, various aches and pains, etc. The incubation period is the 
time interval between the first bite and the appearance of clinical symptoms. 

After a certain time the asexual reproduction of some merozoites comes to 
an end and distinct male and female gametocytes are formed in the red blood 
cells. When an uninfected female mosquito bites an infected human with 
gametocytes in the red blood cells, the gametocytes enter the new host. Once in 
the mosquito's stomach the gametocytes give rise to male and female gametes. 
The fertilization of the female gametes produce zygotes which go through a 
process of development resulting in a large number of sporozoites which 
migrate to the salivary gland. At this stage the infected female mosquito is 
capable of starting the Plasmodium life cycle all over again. 

Thus, the parasite responsible for malaria in humans spends a part of its 
cycle in the human body and the rest in the mosquito. Humans are only an 
intermediate host harbouring the asexual phase of development while the 
mosquito is the final host as fertilization of eggs needed for sexual reproduc¬ 
tion takes place there. It is interesting to note that the mosquitoes do not 
appear to be affected by the parasite in the manner in which humans are. 

There are variations of detail between different species of Plasmodium , but 
the above description is adequate for our purpose - namely, building 
mathematical models to describe the population dynamics of malaria spread. 

3 System Characterization 

In Chapter 3 we defined system characterization as a simplified and idealized 
description of the real world relevant to the problem of interest, and in 
Chapter 4 we saw the role of it in the model building process. Since our 
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problem in this chapter is to understand the population dynamics of malaria, 
the system characterization involves describing variables relevant to the 
spread of malaria and the relationship between the variables. We have briefly 
discussed the role of Plasmodium as the parasite, and humans and mosquitoes 
as hosts in the spread of disease, in the previous section. In this section we 
examine other variables of interest and the framework for characterization. 

3.1 Variables and Interactions 

The variables of relevance are not too many. The variables can be described in 
different degrees of detail. We start with the simplest characterization and 
move on to a more detailed one. 

(i) Human Population 

In the simplest description one would divide the population into two groups — 
infected and uninfected. The uninfected are the group containing the 
susceptibles. One can characterize each group in terms of actual numbers, or 
as percentages of the total population. A slightly more detailed description is 
to divide the population into three groups - infected, susceptible and immune. 
We shall discuss immunity later on. A fairly detailed characterization is as 
follows. The total population is first divided into two categories — immune and 
non-immune. The non-immune category is divided into four groups - 
negative, incubating, positive infective and positive non-infective; the immune 
category is divided into three groups - negative, incubating and non-infective. 
In the above description an individual is called positive only if the parasite is in 
the blood, incubating if the parasite is in the liver, and negative if there are no 
parasites in the body. Individuals move from one group to another as they 



Figure 14.1 Grouping of human population into seven groups. 
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become infected or recover from an attack. A flow diagram for the last 
description is as shown in Figure 14.1. Note that the characterization ignores 
reduction in number due to deaths (either due to malaria or from other causes) 
but includes new births as they join the negative non-immunc group. 

The description becomes more detailed if we include factors such as age, 
quality of life, health standards, etc. The last two factors affect the transition 
from one group to another described above. The influence of age can be 
accounted for either by dividing the population into different age groups or by 
characterizing the population (total number or percentage) as a continuous 
function of age. 


(ii) Mosquito Population 

As with the human population, the simplest description would divide the total 
population into two groups infected and uninfected. A more refined 
description is to divide the infected group into two subgroups - infective and 
incubating. Where there is more than one species of malaria-carrying 
mosquito, we can either treat each species separately or ignore the variations 
between species. 


(iii) Parasite 

We briefly discussed a simple characterization of the life cycle of Plasmodium in 
the previous section. Malaria in humans is caused by four different species 
belonging to the genus Plasmodium. The species are: P. falciparum, P. malariae , 
P. vivax and P. ovale. In the simplest characterization one would ignore the 
difference between the species, whereas a more detailed characterization 
would differentiate between them. 


(iv) Climatic and Geographical Factors 

Climatic and geographical factors play an important role in the life cycle of 
mosquitoes. Pools of water are essential for mosquito breeding. Where this 
element is missing (e.g. deserts) the incidence of malaria is very low. 
Temperature is also an important factor. The sexual cycle of the parasite 
(characterized by the incubation period taking place in the mosquito from the 
acquisition of gametocytes to the production of sporozoites) depends both on 
the species of Plasmodium as well as on the temperature. As a consequence, 
malaria is confined to regions between 60°N and 40 S latitudes, and to places 
with altitude less than 3000 metres. Various parameters of relevance to the 
spread of malaria vary from region to region and must be estimated for each 
region. 
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(v) Mosquito/Human Interaction 

As indicated in the previous section, the onset of malaria in humans occurs at a 
certain stage in the life cycle of Plasmodium , and the life cycle requires 
interaction between mosquito and humans (in the form of the mosquito biting 
the human) with at least one of them being infected. The interaction can take 
place only when there is adequate contact. Adequate contact occurs by chance 
and hence must be characterized probabilistically. A parameter of high 
significance and interest is the probability of an adequate contact. If this is 
large (i.e. adequate contact occurs with high certainty) then the disease will 
spread to all susceptibles very quickly and we have an epidemic. If the 
probability is small, then the spread is relatively slow. The parameter 
characterizing the probability of an adequate contact depends on many 
factors resistance of susceptible to infection; infectiveness of the infection; 
virulence of the parasite; quantity of parasite transmitted, etc. 

In the simplest characterization we consider only the first adequate contact 
which infects the human, and treat the parasite in the human host as belonging 
to a single brood. In real life, multiple adequate contacts often occur over time, 
resulting in more than one brood of parasites being present in the one human 
host. This is called superinfection. A more detailed characterization would take 
this into account. In this case the infected human population must be divided 
into different groups based on the number of broods in the blood stream of 
individuals. 

(vi) Immunity 

Immunity is of special importance as it acts as a barrier to uninfected humans 
getting infected when bitten by an infected mosquito. Immunity can arise in 
many ways - e.g. built up over time due to previous exposures; stimulated 
artificially by vaccination or immunization; genetic factors and so on. 
Accounting for immunity is difficult because of this wide range of possible 
sources. 

3.2 Deterministic vs Stochastic Characterization 

The spread of malaria requires the transfer of parasites from infected hosts to 
uninfected susceptible hosts. From the discussion so far, the two factors which 
are important are: (i) adequate contact resulting in bites which transfer 
parasites, and (ii) conditions to allow the life cycle of the parasite and mosquito 
to operate. The latter is influenced by changes in the season and can be treated 
as being deterministic. In contrast, the former is highly variable and, as 
discussed in Section 3.1, needs to be characterized probabilistically. As a 
result, when the human population is characterized by different groupings (as 
discussed in Section 3.l(i)) the transition between certain groups must be 
treated in a stochastic manner. However, if the human population is large 
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then, as an approximation, we can view the transition as being more or less 
deterministic and the rates of transmission between different groups given by 
the average values of the stochastic case. This approximation implies that the 
process can be modelled in a deterministic framework using mathematical 
formulations discussed in Chapter 5. 

3.3 Time Scales 

One can identify various time scales with regard to the biology and 
epidemiology of malaria. A few of the more important ones are listed below: 

(i) Time for sporozoite to enter the liver after being deposited into human 
blood by mosquito bite — 30 minutes. 

(ii) Incubation period - time between receiving the infection and the ap¬ 
pearance of clinical symptoms — 10-30 days depending on the Plas¬ 
modium species. 

(iii) Cycles of attack (cyclical pattern of symptom occurrence, with symptoms 
being chill, shivering etc.) ~~ 1-3 days depending on the type of malaria. 

(iv) Extrinsic incubation period (see Section 3.2(iv)) depends on the species of 
parasite as well as the external temperature. For the species P. falciparum 
we have the following figures. 

~ 10-11 days when external temperature is 25°C, 

^ 15-17 days when external temperature is 23°C, 

— 22 days when external temperature is 20°C. 

Below 19°C the development of parasite is indefinitely retarded while 
above 32°C the proportion of parasites surviving drops off rapidly. 

(v) Life cycle of mosquito — 14 days. 

(vi) Life cycle of human — 70-80 years. 

(vii) Duration of immunity ~ ^0. 

Let T be the time interval of study - i.e. we are interested in the population 
dynamics of malaria spread over this period. For the study to be meaningful, T 
should be — 1 -10 years. If the changes to the human population are small over 
the time interval of study, we can ignore the variations and treat the total 
population as being constant. The number of mosquitoes can vary from 
season to season, but over a season we can approximate it as being roughly 
constant. Such assumptions simplify the system characterization and hence 
allow the use of simpler mathematical formulations (as dummies) in the 
modelling process. 

3.4 Discrete vs Continuous Time Characterization 

We can treat the time variable as being either continuous or discrete when 
building mathematical models to help our understanding of the spread of 
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malaria. If time is treated discretely, the interval of discretization most 
appropriate depends on the degree of detail incorporated into the model. 
Thus, if the incubation periods are being modelled explicitly, then an interval 
of discretization corresponding to a day is adequate. On the other hand, if 
the variables of the model are only numbers belonging to different groups, 
then larger intervals (e.g. week, month) would be appropriate. For the models 
to be presented in the next section, we will model time as a continuous 
variable. 

4 Mathematical Modelling 

A variety of models have been built to model the spread of malaria through the 
population. The earliest model is due to Ross, and was proposed in 1916. It is 
impossible to discuss all the models that have appeared since then. Here we 
shall discuss a series of six different models chosen to highlight the iterative 
nature of modelling where each subsequent model is an improvement over the 
earlier. The modelling of the spread of malaria is difficult, for the following 
reasons. 

(i) The mechanism for the spread of malaria is very complicated; hence great 
care needs to be taken to ensure a proper trade-off between the reality and 
the complexity of the model. 

(ii) The model parameters depend on geophysical location. Hence a model 
with a specified parameter set value for a particular region might not be 
valid for another region. 

(iii) The data available for parameter estimation, as well as for validation, are 
often limited and inadequate. In some instances the parameters can only 
be estimated based on a small section of the population - e.g. a cohort of 
newborn babies. In other instances, parameters must be obtained by 
specific field or laboratory experiments. We shall discuss some of these 
aspects when we consider specific model formulations. 

(iv) The problem of validation is difficult and requires combining forecasting 
abilities of the model with biological and epidemiological considerations. 

Sections 5-9 deal with five deterministic model formulations which can be 
viewed as a progressive sequence of improved models. In Section 10 we 
examine a stochastic model. 

5 Model I (Ross, 1916) 

This model was first formulated by Ross in 1916 [5]. The spread of malaria is 
characterized as a deterministic process and the approach taken is the “black¬ 
box" approach as the system is characterized through a single variable 
representing the proportion of the human population affected by malaria. 
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5.1 Model Formulation 

Let x(f) represent the proportion of human population affected by malaria at 
time t. As a result, the proportion unaffected is 1 — x(f). Changes to x(f) over 
time occur due to unaffected people getting infected, and affected people 
recovering. Note that the model does not differentiate between infectious and 
non-infectious in the group of affected people, nor between susceptibles and 
immunes in the unaffected group. Let r represent the recovery rate (i.e. 
proportion of affected people who recover in a unit time) and h represent the 
proportion of unaffected people receiving infectious bites per unit time. It is 
assumed that these two parameters do not change with time. 

Thus, changes to x(f) in a small interval Ai are given by 

x(f 4- At) — x(r) = {/i(l — x(f)) — rx(f)}Af (1) 

which in the limit At-»0 yields 

dx(f)/df = h - (r + h)x(t) (2) 

i.e. the model involves a simple linear first-order ordinary differential equation 
formulation. 

If the proportion of infected at time r = 0 be given by 

x(0) = x o (3) 

then the model is completely specified by (2) and (3). 

The parameter set is given by 

e = {r,h} (4) 

with the constraint 

© = {r^O-.h^O}. 


5.2 Model Analysis 

It is easily seen that x(f),r ^ 0 is given by 

x(r) = {Mr + h )}-{[Mr + *)] - x 0 } e- ,r+M * (5) 

As f-»oo, we have the asymptotic value x(oo), which is given by 

x(oo) = Mr + h) (6) 

and this corresponds to the proportion of infected population under what is 
called the endemic condition. 

Some special cases are as follows: 

(i) r = 0,h >0->x(oo) = 1; i.e. the whole population is infected. 

(ii) r>0,h = 0-»x(oo) = 0; i.e. the infection disappears 

(iii) r = 0,h = 0-*x(ao) = x 0 ; i.e. there is no change. 
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If we concentrate on newborn children, then x 0 = 0, which yields 

x(f)={Mfc + r)}{l— e- ,r+ *»'} (7) 

( Note : In the above case, x(t) represents the proportion of newborn population 
affected by malaria at time r.) Let p(t) represent the population of infected 
people detected at time t using some testing procedure. If the testing procedure 
is perfect, then p(t) = x(t). However, if an infected person is detected with 
probability k( 0 < k < 1) then we have 

pit) = MO = {kh/(h + r)} {1 - e' (r+ *"} (8) 

The above equation will play an important role in parameter estimation, with 
k being treated as an additional parameter. 

5.3 Parameter Estimation 

By observing the spread of malaria in a cohort of newborn children, we can use 
(8) to obtain estimates for the parameters of the model. Let n, represent the 
number of children observed, and a, the number found to be infected at time f„ 
1 < i < /• Then the estimates can be obtained by minimizing 

Z {p(0)-(«,/«,)} 2 (9) 

i = 1 

where p(f,) is given by (8). 

An alternative statistical approach is to view a, as being the number of 
children infected in a population n, at t, when the probability of being infected 
is given by p,. As a result, the likelihood of observing a„ given 

n„ 1 i /, is given by 

(io) 

where K ( is given by 

£( = {<*(! («(-«()!}/«(! 

and is independent of the parameters to be estimated. Note that the unknown 
parameters enter through p, = p(t,), with p(r ( ) given by (8). The log-likelihood 
function is given by 

/ 

L=k+ y {a, logp, + (n, - at) log( 1 — p,)} (11) 

where K is a constant independent of the parameters to be estimated. The 
estimates are obtained by maximizing L with respect to the parameters. 

Bailey (1982) [3] obtained parameter estimates using data from Pull and 
Grab (1974) [11] but aggregated monthly. The aggregated data is given in 
Table 14.1, and the value oft, used for each interval is the central point of the 
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Table 14.1 Aggregated Monthly Data for Observed Proportion of Infected [3] 


Age 

interval 

Central 

point 

If 

N umber 
inspected 
ni 

Number found 
positive 
flf 

Observed proportion 
of infected 

Pi = flf/Wf 

1 

45 

202 

26 

0.129 

2 

75 

235 

76 

32.3 

3 

105 

210 

91 

43.3 

4 

135 

210 

93 

44.3 

5 

165 

243 

140 

57.6 

6 

195 

219 

121 

55.3 

7 

225 

219 

128 

58.4 

8 

255 

241 

135 

56.0 

9 

285 

179 

121 

60.8 

10 

315 

204 

118 

57.8 

11 

345 

205 

132 

64.4 


interval. Note that for the data used 7=11. The estimates obtained by Bailey 
are as shown below. Figure 14.2 shows the fit between the model output and 
the observed data indicating a reasonably good fit. 

h = 0.0089 ± 0.00059 ] 

/> = 0.00050 ±0.00148 (12) 

k = 0.637 ± 0.078 ] 

Since r is very small, we can treat it as being zero for young children and 
regard h and k as the two unknown parameters. The estimates obtained by 
Bailey (1982) [3] are given by 

h = 0.0081 ±0.00054) 
k = 0.662 ± 0.024 j 



t (months) 

Figure 14.2 Comparison of Model I response and observed data (O)- 


5.4 Validation 
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One way of validating the model is through the goodness-of-fit between model 
prediction and the observed data. This would require a statistical test and 
Bailey [3, p. 150] uses the x 2 test to validate the above model. 

Another way to evaluate the model is in terms of its qualitative behaviour. In 
the above model, as well as in the many cases studied by Macdonald 
([4], [6]) using a cohort of newborn babies, the value for r obtained is very 
low. Since the reciprocal of r corresponds to the duration for which a person 
stays infected, once bitten by an infectious mosquito, r = 0.0005 corresponds 
to an infection lasting 6 years. This is in strong disagreement with the actual 
average period of infection. Hence the validity of the model with r very small 
is doubtful. 

It is worth noting that the small values obtained for r involved field data for 
newborn babies; hence one might infer that r is small only in the early stages of 
life, and that it increases with age. If this be the case, then the above model must 
be rejected, as it assumes the recovery rate to be a constant independent of age. 

Finally, the model is a gross over-simplification of the process for the spread 
of malaria, for it ignores many features e.g. mosquito population, incubation 
period, uncertainty in the spread, to name a few. 


6 Model II (Mcdonald, 1950) 

This model was first proposed by Macdonald [6] with the mathematical 
development done in conjunction with Irwin; hence it is sometimes called the 
“Macdonald-Irwin model”. This model was built to overcome the deficiency 
of Model I, which yielded very low values for r when fitted to real data. The 
model is built using the concept of superinfection, which assumes that the 
human host can carry multiple broods of parasite and is in infected state as 
long as there is at least one brood present. Unfortunately, the precise notion of 
multiple broods is not explicit in the model formulation. 

6.1 Model Formulation 

We use the notation of the earlier model. The model is given by 

dx(f )/df = h- rx{t) h ^ r ) 

= fc(l-x(t)) h>r\ 

An intuitive justification is as follows. If A < r, then in time At all individuals, 
whether infected or not, exhibit new infection at a rate hAt. Thus, the change in 
x(r) over At is given by 


{h-rx(t)}At 


(15) 
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If h > r, then once infected the individual would never recover; hence change in 
x(t) over At is given by 

h(\ -x(t)) (16) 

In the limit Af -► 0, we have (14) as a consequence of (15) and (16). The model is 
completely specified by specifying the proportion of population infected at 
time t = 0, i.e. by x 0 given by (3). 

6.2 Model Analysis 

It is easily seen that x(t) satisfying (14) is given by 

x(t) = (h/r){ \ — e“ r '}, /i^rl 
= 1 -e“* h>r\ 

As t -► oo the asymptotic value is given by 

x(oo) = (/i/r), h^r) 

= 1 , h>r] 

For a small value of r,x(f) obtained from this model 
obtained from Model I (equation (7)). 

6.3 Parameter Estimation 

Using various bodies of field data, Macdonald [6] obtained estimates for r 
closer to the realistic values of r ~ 0.005. Thus the model is an improvement 
over Model I from this point of view. 

6.4 Validation 

As mentioned earlier, the model is an improvement in the sense that it yields 
more realistic values for the parameter r. Unfortunately the model does not 
truly incorporate the original concept of multiple broods of parasite in the 
body as formulated by Macdonald. In addition, the model suffers from the 
over-simplification discussed in Model I. 

7 Model III (Dietz, 1970) 

This model was first developed by Dietz [7] to represent adequately the 
concept of superinfection formulated by Macdonald, and is called the 
“Macdonald Dietz model”. The characterization is done through the total 
population being divided into different groups based on the number of broods 
present in an individual. Thus, individuals belonging to group j carry j broods 
of parasite in their bodies. Ifj=0 then the group is the collection of individuals 
unaffected by malaria. As in the earlier models, the characterization of the 
mechanism for the spread is done in a deterministic framework. 


(17) 

( 18 ) 

is very close to that 
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7.1 Model Formulation 

Let fj(t\0 < oo, represent the proportion of the total population with 
exactly j broods. If x(f) is the proportion affected by malaria at f, we have 

x(t)= t fj(t) (19) 

;= i 

and the unaffected proportion given by 

l-*(') = /o(f)=l- tfAO (20) 

j=i 

In an interval At, a change in fj(t) can occur when one or more persons in 
group (j — 1) is bitten by an infectious mosquito resulting in an increase in the 
number of broods by one in the bitten individual, or when one or more persons 
in group (j + 1) recovers partially so that the number of broods in the infected 
person is reduced by one. 

Let r define the recovery rate for individual brood and h be the rate of new 
infections being introduced. We have the increase in fj(t) in time At given by 

{Vi-iW + ry+D/j+tWjAr 

and the decrease in fj(t) given by 

Uh + rj)fj(t)} 

Using a limiting argument we have the model given by the following set of 
equations: 

d/ o (0/dr = - hf 0 (t) + rf , (f) 

dfj(t)/dt = hfj _, (r) - (h + rj)fj(t) + r(j + 1 )f J+ , (f), j > 1 
The model is completely specified once the distribution at t = 0 is specified; i.e. 

/,(0)=/,9(0^00) ) (24) 

(with fj ^ 0 and £ /? = 1) 

J m 0 J 

is given; x(f), the proportion infected, is given by (19). 

7.2 Model Analysis 

We shall obtain the solution for the case applicable for a cohort of newborn 
babies. In this case, (24) is given by 

f°o= 1; /? = 0, (25) 

Define 

F = F(io, t) = £ co j fj{t) 
i=o 




( 26 ) 
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Then multiplying (23) by a) j and summing over all j we have 


dF/dt = h(o) — 1 )F — r[a> — 1 )dF/d a) (27) 

and from (24) and (25) we have 

F(a>, 0) = 1 (28) 

Equation (27) is a first-order linear partial different equation and can be solved 
to yield 

F(co, f) = exp {— {h/r)(o) — 1)(1 -e'")} 

From (19) we have 

= 1 -exp{-(/i/r)(l-e- r ')}. 

As r-MX) we have the asymptotic result 

x(oo)= 1 — exp( — h/r) 

which yields the proportion affected under endemic conditions. 

Let p(t) represent the proportion of infected people detected at time t using 
some testing procedure. If the procedure is perfect, then p(t) = x(f). If the testing 
procedure is not perfect, then the proportion of observed infected people at 
time is given by p{t) = /cx(f), with 0 < k ^ 1. 

7.3 Parameter Estimation 

The procedure for estimating the parameters (viz. A, r and k) is essentially 
similar to that discussed in Section 5.3. Using the cohort of newborn babies 
with a h n { and f, as defined in Section 5.3 we have the log-likelihood function 
given by (12) with p, given by 

Pi = kl 1 -exp{-(/i/r)[l -exp(-rfi)]}] (32) 

For the data given in Table 14.1, the estimates for the parameters are nearly 
the same as that given in (12). This follows, as for small values of r, p ( obtained 
from (32) is nearly the same as that obtained from (8). 

7.4 Validation 

The comments of Section 5.4 are equally applicable here, so we shall not repeat 
them. However, the model is an improvement as it incorporates the concept of 
superinfection. 

8 Some Comments 

In the previous three sections we discussed three deterministic models for the 
spread of malaria. In this section we comment on the models studied so far, so 


(29) 

(30) 

(31) 
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as to give a better insight and provide a basis for some new model 
formulations. 

(i) The models are over-simplified. 

(ii) The models discussed so far do not characterize the mosquito population 
and the interaction between humans and mosquitoes. 

(iii) The models are deterministic, whereas the interaction and spread are 
highly stochastic. 

(iv) The parameters r and h were obtained using data for observed propor¬ 
tions of infected in the cohort of newborn babies. These parameters need 
to be related to more basic parameters of the process for the spread of 
malaria. 

In this section we discuss (iv). Improved models which overcome the 
limitations discussed in (i)-(iii) are studied in subsequent sections. 

In addition to x, r and A, defined earlier, we employ the following symbols. 
Our presentation follows along the lines of Bailey [3]. 

m: mosquito density per human 

a: average number of humans bitten per day per mosquito 
s' proportion of mosquitoes with sporozoites in their salivary gland 
b: proportion of mosquitoes, having sporozoites in their salivary glands, 
that are actually infectious 
n: extrinsic incubation period 

p: probability that a given mosquito survives over a given day 
y: mosquito death rate. 

From general biological consideration we have 

h = masb (33) 

We assume that, once infected, the mosquito stays infected for the remainder of 
its life. A mosquito gets infected only when an adequate contact is made with 
an infected person. Because of the extrinsic incubation period there are no 
infected mosquitoes with age < n . Thus, if we assume that adequate contacts 
between a mosquito of age t > n and infected humans is given by a Poisson 
process with parameter xa(t - n), then the probability of at least one adequate 
contact is given by 

1 — exp( — xa{t — n)) (34) 

Thus the proportion of infected mosquitoes of age r to r + dr is given by 

{1 - exp (- xa(t — n))}ye“ y, df (35) 

and by integrating (35) from n to oo we obtain the overall proportion of 
infected mosquitoes which corresponds to s. In other words, we have 
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s = axe yn /(ax + y) (36) 

Since the mosquito death rate is y, the probability that a mosquito survives one 
day is given by 

P = e" y (37) 

Substituting (37) in (36) we obtain 

s = axp n /(ax - log p) (38) 

and from (33) we have 

h = (ma 2 bxp n )/(ax — log p) (39) 

This shows explicitly the dependence of h on the various basic parameters of 
the mechanism and the human/mosquito interaction. Some of the basic 
parameters can be obtained by field observations or controlled laboratory 
experimentation. The basic parameters change with geographical location as 
well as seasonally. 

Some estimates for the parameter values are given in Bailey [3, Ch. 9] and 
references cited therein. 


9 Model IV (Dietz, Molineaux and Thomas, 1974) 

This model was developed in close co-operation with epidemiologists, 
entomologists and immunologists in connection with a field project in the 
district of Garki in Northern Nigeria [8]. The spread is modelled as a 
deterministic process but the system characterization is more detailed. The 
total population is divided into seven different groups, as shown in Fig. 14.1 
and briefly discussed in Section 3.1 (i). The rate of transfer between groups is 
related to some basic parameters, and as such the model incorporates a high 
degree of realism. 

9.1 Model Formulation 

We use the symbols defined in the original paper. The variables refer to the 
proportion of the human population in the seven different groups as a function 
of time. The variables defining the different groups are as follows. 

Xj(r): non-immune/negative 

x 2 (t ): non-immune/incubating 

yj(r): non-immune/positive infections 

y 2 (t): non-immune/positive - non-infectious 

x 3 (t): immune/negative 

x 4 (f): immune/incubating 

y 3 (t): immune/positive - non-infectious 
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The rates for transition between different groups are given as follows. 


h(t): 
Q(t ): 

a 2 : 

R { : 

R 2 : 


from “negative” to “incubating” for both non-immune and immune; 

from incubating to infectious in the case of non-immune and from 

incubating to non-infectious in the case of non-immune; 

from infectious to non-infectious in the non-immune; 

from non-immune non-infectious to immune non-infectious; 

from non-infectious positive to negative in non-immune; 

from non-infectious positive to negative in immune category. 


In addition, we define 


n : fixed incubation period; 
ty. extrinsic incubation period; 

6: rate of new births as well as death rate for all groups. 

Using a discrete time characterization with the discretization time interval 
being unity, we have, from simple balancing, the changes to the various 
proportions given by the following set of equations: 


Axj = <5 -1- R x y 2 -(h + S)x x 
Ax 2 = hx | — (?x,(f — n) — dx 2 
Ax 3 = R 2 y 3 -(h + S)x 3 
Ax 4 = hx 3 — Qx 3 (t — n) — Sx 4 
Ayi = QZi(t-n)-(xi-#)yi 
Ay 2 = a 1 y 1 ~(oc 2 + R l +S)y 2 

Ay 3 = «2.y 2 “ Q* 3(' - n) - (*2 + % 3 , 


(40) 


(Note: We have suppressed the argument t for most of the variables. Because 
of the incubation period there is a time lag of n. Hence some of the variables 
are the lagged variables (i.e. at time t — n) and these are shown explicitly.) 

We omit the details of the derivation which relate some of the rate 
functions to more basic parameters. The final form is given by 


(? = (1 —5)/i(f — h) 
h = g[\ -exp{ -Cit-^y^t-n)} 
Ri = h/{ exp (fc/r,) - 1}, 1 ^ i 2 


(41) 


where C(r) is a function which is called “vectorial capacity” and characterizes 
the relevant information of mosquito population and r, (1 ^ i ^ 2) represent¬ 
ing rate of elimination for a given brood. Defining </, to be probability of 
detecting the presence of parasite, then the proportion of population 
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detected to be positive is 

P( t) = Z WM < 42 ) 

i = 1 

and the true proportion which is positive is given by 

*W = i yM (43) 

i= 1 

If one assumes that q x = q 2 , then the model has 10 parameters given by 

0 = {5 t r 1 ,r 2 ,^ 1 ( = ^ 2 ),q 3 ,n,a 1 ,a 2 ,^y} (44) 

(/Vote: We need to specify the vectorial capacity C(t) using entomological 

considerations, and once this is done the model is completely specified by 
specifying the values for the seven variables at t = 0.) 

9.2 Model Analysis 

It is not possible to obtain an analytical solution to the set of difference 
equations given by (40). The form of the model makes it ideally suited for 
implementation on a digital computer; hence such solutions are easily 
obtained. 


9.3 Parameter Estimation 

Dietz et al. [8] fitted the model to two different populations with widely 
varying levels of vectorial capacity. They assumed values for seven para¬ 
meters based on entomological and biological considerations, and the values 
assumed are as follows: 


6 = 0.0001 
r 2 /fi = 10 

<?i(=<? 2 )= 1 

<73 = 0.7 
y = 10 days 
n = 15 days 
<*! = 0.002 


(45) 


The remaining three parameters were estimated using field data and are as 
follows. 


a 2 = 0.00019 ±0.0001 
$ = 0.097 ±0.017 
f x = 0.0023 ± 0.0005 


( 46 ) 
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Figure 14.3 is a reproduction of a figure given in Deitz et al. [8] which shows 
the comparison between the model output obtained using the above set of 
parameters and some observed field data. 


9.4 Validation 

With the restricted data available for estimation it is generally agreed by 
experts that the model is an adequate representation for the spread of 
malaria. However, the assumption that the mechanism for the spread is 
deterministic is not justifiable. Hence a stochastic model which overcomes 
this drawback would yield an even better model. We discuss one such. 


10 Model V (Bekessy, Molineaux and Storey, 1976) 

The models discussed so far (Models I-IV) are deterministic - i.e. the 
changes to x(r), the proportion of total population infected, are given by a 
deterministic formulation. In this section we discuss a model which 
characterizes the change from infected to uninfected or vice-versa by an 
uncertain chance mechanism. 
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10.1 Model Formulation 
Define N and X(f) as follows: 

N: total population (assumed fixed), 

X(t): number of individuals infected at time t. 

As a result we have 

N — X(t): number of susceptibles at time t. 

We assume that the probability of one new infection occurring in time t to 
t + At is given by h(N - X(t))bt and the probability of one recovery in the 
same time period by rX(t)At. 

10.2 Model Analysis 

Define pj(t ) and P{z,t) as follows: 

Pj = pTob{X(t)=j}0^j < N (47) 

and 

P(2,t)= X P;(0z j (48) 

j=o 

Then, using a method similar to that used in Section 7.1 for obtaining/^(r), 
we have p ; satisfy the following set of differential equations: 

dp 0 (t)/dt = - hNp 0 (t) -I- rp x (t) 

dpj(t)/dt = — {h(N —j) — rj } Pj (t) + h{N + 1} Pj- x (0 

+ r(j + l)p i+1 (0 ( 49 ) 

for 1 < j ^ N — 1, and 

dp N {t)/dt = - rN P N (t) + hp N - x (t) 
and P(z y t) satisfies the partial differential equation 

dP/dt = (1 - z)(r + hz)dP/dz + yh(z - 1 )P (50) 

If we assume that, at t = 0, the number of infected individuals is given by 

x(0) = a 

then 

p,(0)=l if j = a | 

= 0 otherwise] 

and 

i>(z,0) = z* 


(51) 

(52) 
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With the above initial condition, (50) can be solved analytically to yield 

P(z y t) = (r + h)' N {(r + hz) + r(r- l)e' <r+/,,r } fl - {(r + hr) 

-h(z- i) e -<' + (53) 

From this, we can obtain expressions for various moments of X(t). Let 

m(t)=ElX(t)-}/N (54) 

represent the mean value for the fraction of people infected at time t. Then, 
from (53) we have 

E[X(t)l = ldP/dz] :=l 

= { Nh/(r + /i)} {1 — e _(r+#,)f } (55) 

and substituting this in (54) yields 

m (t) = { h/(r + h) } {1 - e-*'^'} (56) 

This is identical to x(t) in Model I. Thus, this model can be viewed as a 
stochastic version of Ross’s model. 

10.3 Parameter Estimation 

In stochastic modelling, one can estimate the model parameters by various 
methods - e.g. method of moments; maximum-likelihood, etc. The deriv¬ 
ation of the maximum-likelihood estimator follows along the lines indicated 
in Section 5.3 and is omitted. 

10.4 Validation 

One way of validating is to evaluate the goodness-of-fit between the 
estimated mean of X(t) based on the data available and the mean of X(t) 
obtained from the model. This can be done in a manner similar to that 
discussed in Section 5.4. 

The main criticism of the model is that it treats individuals as being either 
infected or not. This is a gross over-simplification and hence the model is 
suspect. An extension to include different groups (as done in Model IV) 
would make the model more realistic. 


11 Conclusions 

In this chapter we have examined five different models for the population 
dynamics of malaria spread. The reason for the choice was to illustrate the 
iterative nature of modelling where one starts with the simplest model and 
improves it at each stage to produce a better one. Many other models have 
been developed and applied with varying degrees of success. Notes at the 
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end of the chapter give references to sources where more details of these can 
be found. 


Notes and References 

Most books on infectious diseases contain a chapter on malaria. See for 
example [1] and [2]. There is a large number of books dealing specifically 
with malaria (e.g. [3] and [4]). In particular, the book by Bailey deals with a 
variety of mathematical models that have been developed. We have 
examined only a few of these in this chapter. For further details of the models 
discussed in this chapter, see either [4] or [5]—[8]. For other models see 
Bailey [4] and the references cited therein. An excellent source for more 
information, including data about the spread of malaria, is [9]. There are 
many papers in the above journal which give field data from different parts of 
the world. 

The modelling of different control actions and their effect on the spread of 
malaria is still at an infancy stage and mainly of theoretical interest. A review 
of models for the control of infectious diseases until 1977 can be found in 
[10]. Since then a few more models have been developed, some being for the 
control of the spread of malaria, but in general they are highly over¬ 
simplified and inadequate for practical implementation (e.g. [11]). 
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CHAPTER 15 


Designing a Pneumatic Pump 


1 Introduction 

In an aquarium it is essential that there be a good supply of dissolved oxygen 
to maintain the health of the fish and other life inside the tank. This is often 
achieved by pumping air into the aquarium. The performance of the pump 
depends on its design, which in turn involves two phases. The first is a 
conceptual design in which a suitable scheme is developed. This involves 
identifying the different components and the coupling between components 
necessary for a solution. The second phase deals with the specification of 
components in terms of their size, characteristics and other functional 
parameters so that the pump performance meets some stated criterion. In this 
chapter we shall concentrate on the second phase. More specifically, we are 
interested in predicting the performance of a small pneumatic pump, suitable 
for domestic aquariums, as a function of the characteristics of its components! 
We build a sequence of models, of increasing complexity, to achieve that. 

In Section 2 we carry out the system characterization. We take a "‘white- 
box” approach for two reasons: firstly, the mechanics of pump operation are 
well understood and hence possible to describe explicitly; secondly, such an 
approach is essential to relate the performance of the system to the 
characteristics of individual components. The characterization, and model 
representations, are best done using bond-graphs. The bond-graph technique 
is a powerful generalized technique which is particularly useful in modelling 
dynamic systems that can be described in terms of energy flows and 
transformations. A brief introduction to bond-graphs is given in Section 3. In 
Section 4 the bond-graph technique is applied to building a sequence of 
models of the air pump. Throughout the sequence there is progressive 
refinement of the model with a critical analysis of the effect of each refinement. 
Since there are no performance data available (as the problem is a design 
problem and the pump is yet to be built) this critical evaluation can be viewed 
as “model validation”. In Section 5 we discuss the analysis of the final model to 
obtain the end-product viz. relating pump performance to component 
characteristics. We conclude with some general comments about the model¬ 
ling process in Section 6. 
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2 System Characterization 

A schematic representation of the conceptual scheme for a simple pneumatic 
pump is shown in Figure 15.1. The pump can be viewed as a system. The 
boundary (or interface) between the system and its environment is shown in 
Figure 15.2. The system receives electric power and air (at atmospheric 
pressure) from the environment, and in turn supplies air (at a pressure higher 
than the atmospheric pressure) to the aquarium, which is a part of the 
environment. The components of the system are: 

(1) electromagnetic coil 

(2) magnet 

(3) lever 

(4) bellows 

(5) inlet and outlet valves 

(6) pipes (or tubes) for flow of air. 

The pump operating mechanism is fairly simple. When an alternating 
voltage from the mains supply is applied across the coil, an alternating 


Coil 



Figurk 15.1 Schematic representation of a simple pneumatic pump. 



Figlrf. 15.2 System/environmcnl interface. 
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magnetic field is set up. This field interacts with the magnet to produce an 
oscillating motion (in a vertical plane) of the magnet. This motion is 
transmitted through the lever to produce a vertical motion of the bellows. 
When the bellows move up, air is drawn through the inlet valve (with outlet 
valve closed) and when it moves down, air is pumped through the outlet valve 
(with inlet valve closed). 

A better understanding of the mechanism is achieved by viewing the system 
in terms of the energy transformation. From this point of view the system can 
be described as follows. Electrical energy is transformed into mechanical 
energy, which in turn is transformed into pneumatic energy. Since the device 
operates continuously, energy flow, or power, is more appropriate to describe 
the system. Thus, electrical power (given by the product of voltage and current) 
is transformed into mechanical power (given by the product of force and 
velocity) which in turn is transformed into pneumatic power (given by the 
product of pressure and flow rate). 

The variables characterizing the components can be described using terms 
from different theories in physical sciences. The terms that will be used are: 
mass, damping coefficient, force, velocity from dynamics; resistor, inductance, 
voltage, current from electrical circuit theory; pressure and flow rate from fluid 
mechanics. The relationships between the variables are given in terms of the 
laws of these theories. 

The components or devices (listed earlier) can be divided into three types: 

(i) energy (or power) transforming devices, 

(ii) energy storage devices, and 

(iii) energy dissipation devices. 

Viewed from an energy (or power) transformation point of view, the three 
main components of the system are: electromagnetic actuator, lever and 
bellows. In the simplest characterization these can be viewed as couplers. A 
coupler is a device which transforms energy (or power) with a power conserving 
relationship. Both the input and output are characterized through two 
variables: one called “effort variable” and the other called “flow variable”. The 
product of the two input variables gives the power inflow and the product of 
the two output variables gives the power outflow. In a coupler the power 
output equals the power input, and the output flow and effort variables are 
related to the input flow and effort variables by a simple algebraic relationship. 

The effort and flow variables associated with electrical, mechanical and 
pneumatic power are as follows: 


Electrical 

Effort variable 

Flow variable 

Voltage 

(e) 

Current 

(0 

Mechanical 

Force 

</) 

Velocity 

to) 

Pneumatic 

Pressure 

</>) 

Flow rate 

to) 
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With this background we are in a position to characterize the three main 
components mentioned earlier. 

The electromagnetic actuator is a coupler which transforms electrical power 
into mechanical power. It can be represented schematically as shown below: 


e x 

H 

/, 

«i 

”1 



where e x represents the voltage across the coil; /, the current in the coil;/, the 
force acting on the magnet and i>, the velocity with which the magnet is 
moving. The relationship between the output variables and the input variables 
are given by 


v x =(\ IH)e x ) 

J 


( 1 ) 


where H is a constant. Note that the two gains (i.e. H and \/H) are reciprocals 
of each other and the product v x f x =e x i l% signifying power conservation. 

The lever is a coupler in which both the input and the output power are 
mechanical. A schematic representation of this is as shown below: 


fl 

L 


»2 


”3 


with the variables related by 

fe -OWll 

r 3 = Lv 2 J 


( 2 ) 


where L is the lever ratio. Note again that / 3 t> 3 = f 2 v 2 , signifying power 
conservation. 

Finally the bellows, viewed as a coupler, can be represented by a schematic 
diagram as shown below: 


h 

A 

P4 

”4 


<?4 


with 

/ > 4=0/>t)/4j (3) 

q 4 =Av 4 j 

where A is the cross-sectional area of the bellows, and again we have 
f 4 v 4 = </4/V 

The other components are listed below in terms of their relative importance. 
Each is identified as either an energy storage device (ES) or energy dissipation 
device (ED): 


(i) mass of magnet (ES), compliance of rubber bellows (ES); 

(ii) resistance of coil (ED), inductance of coil (ES); 
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(iii) internal damping in rubber bellows (ED); 

(iv) friction at lever pivot (ED); 

(v) air friction at inlet and outlet valve (ED); 

(vi) compressibility of air (ES). 

Each component needs to be characterized by an appropriate flow and an 
effort variable and a relationship between the two. We shall do this in the next 
section, using the bond-graph approach. 


3 Bond-Graphs: an Introduction 

Table 15.1 contains the main symbols for bond-graphs that will be used later 
in the chapter. A simple bond is represented by a straight line with the effort 


Table 15.1 Bond-graph Symbols 


Variables, power, causality 
e 

— simple bond; effort (e) on top and flow (/) on bottom 

—> half-arrow to indicate positive power flow 

-| causal stroke: effort to right, (low to left 

|- causal stroke: effort to left, flow to right 

j-— positive power flow and effort to left 


One-port elements 
— X 



X = R, L or C (electrical) 

= D, M , or K (mechanical) 
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variable on top and the flow variable below. A half-arrow indicates the 
direction of positive power flow and a cross-stroke indicates causality. 

One-port elements are of three types. The one-port resistor is an energy 
dissipating element and the relationship between effort and flow is given by 

e = Xf (4) 

where X represents impedance, for example electrical resistance (R) or 
mechanical damping (D). The one-port capacitor is an energy storing element 
with a relationship of the form 

e = (l/X)J/(f)df (5) 

where X here represents electrical capacitance (c) or spring stiffness (k). The 
one-port inductor is also an energy storing element with a relationship of the 
form 


e = Xdf(t)/dt (6) 

where X represents electrical inductance (L) or mass (M). 

The two-port elements that are of interest in many dynamic problems are 
the transformer and the gyrator. Both are power conserving and the 
relationship between the variables is as shown in Table 15.1. The three-port 
elements are also power conserving elements and are of two types - the 1- 
junction and the 0-junction. The relationship between the variables is as 
shown in Table 15.1. 

With this background we are now in a position to build models of the air 
pump using bond-graphs. 

4 Model Development 

In this section we develop a sequence of models starting from the simplest. 
4.1 Model 1 

A simple model is obtained by ignoring all energy storage and dissipation 
elements. The system is viewed as being made of three separate energy (or 
power) transforming elements. As a consequence the model is given by the 
schematic representation shown in Figure 15.3. We have discussed each of the 
three devices in Section 2. The inputs to the model are e t and /, and the outputs 
are p 0 and q 0 . The model is given by equations (1)—(3) and the following 


e, = e, 

H 

frfz 

1 

f 3=f« 

A 

P4= PO 

ii = i. 

< 

"m , 
< 

L 

v 3 s v 4 

M 

<U = do 


Figure 15.3 Schematic representation of Model 1. 
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and 


*.=*<■) 

«i = *i j 

fi=A] 

w»l 

Po = P* 1 
?0 = <?4 J 


(7) 

( 8 ) 
(9) 

( 10 ) 


From these we obtain the following relationship between the output and the 
input. 


q 0 =(AL/H) ei j 
ii = (AL/H)p 0 J 


( 11 ) 


The reason for writing (11) as indicated above is that in the design e, (the input 
voltage) and p 0 (the pressure at which air is to be delivered) are specified. The 
selection of system parameters (given by A, L and H) determine q 0 and t, 
uniquely. In other words, e, and p 0 are the independent variables (for design 
purposes) and q 0 and /, are the dependent variables. As no energy is stored or 
dissipated, we have 

Po<?o = e.'f (12) 


The main drawbacks of this model are as follows: 


(i) Since the model ignores all energy dissipative elements it is highly 
unrealistic. 

(ii) Because of (i), the variable q 0 (i,) changes instantaneously as e,(p 0 ) changes; 
this also is unrealistic. 

(iii) As p 0 increases, also increases. Thus, should a blockage occur on the 
output side, the current drawn (/',) approaches infinity. This follows 
because a blockage is equivalent to p 0 -* oo. This is not true, and hence the 
model is unacceptable. 

The bond-graph representation of this model is shown in Figure 15.4. This 
representation contains more information than the schematic representation 
shown in Figure 15.3; i.e. causal relationships (indicated by short vertical lines) 


e,C = e,) , v 

-—-^ GY |-——^ 


TF |-^ 


. Po< = P«> „ 

TF |-^ 


v,('v 2 ) ' v,= (v.) ' q 0 (=q 4 ) 

Figure 15.4 Bond-graph representation of Model 1. 
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and flow of power (indicated by half-arrow tips). The advantage of bond-graph 
representation becomes more noticeable as we improve on this simple model. 


4.2 Model 2 


This model includes two new features, which are the mass of the magnet and 
the compliance of the bellows. Both of these are energy storage elements. The 
effect of including the mass of magnet is that f 2 #/ 3 . This follows as the total 
force generated by the electromagnetic actuator must now equal the force to 
move the magnet (given by mass times rate of change of velocity) plus the force 
to operate the lever mechanism. As a consequence, instead of (8) we have* 

/. =f 2 + Mdy 2 /dt 
v 2 = v t 

From (1) we see that 



fi v i = />i - Mv 2 dv 2 /dt 

= f l v l -d/dt{Mv>/2} (14) 

This can be interpreted as follows: power generated to drive the lever and 
bellows ( f 2 v 2 ) is equal to the power generated by the electromagnetic actuator 
minus the power used in altering the kinetic energy stored in the mass of 
magnet. 

The effect of the compliance of the bellows is also similar and, as a 
consequence, instead of (9) we have 


/Wj-Kjujdf 

t > 4 = l > 3 


(15) 


where K is the compliance of the bellows. 

Thus the overall model is given by equations (1H3), (7), (10), (13) and (15) (i.e. 
it differs from the earlier model in the sense that (8) and (9) are replaced by (13) 
and (15) respectively). Defining: 

*2=/3-/4 (16) 

and on manipulating the above-mentioned set of equations, we have 

q 0 = (AL/H)e, (17) 

and 

ii = (M/H 2 ){d ei /dt)} + (\/H){Lx 2 + LAp 0 } (18) 

dx 2 /dt = (KL/H) ei (19) 


*We shall omit writing the argument i for notational convenience. Thus, we shall write v 2 
instead of t> 2 (0, and so on. 
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Equation (17) is the same as the first equation in (11). However, i, differs, as it is 
a function of both the derivative and the integral of e t . The drawbacks of this 
model are as follows. 

(i) q 0 changes instantaneously with changes in e,; 

(ii) if e,- changes suddenly (e.g. a step change), then i, assumes unbounded 
values. This does not happen in real life; hence this model is still inadequate 
and needs to be further modified. 


4.3 Model 3 


Model 2 was obtained from a system characterization which ignored all 
energy losses in the system. In contrast. Model 3 is obtained from a modified 
system characterization which includes a feature to account for one such loss. 
More specifically, it treats the coil (or the electromagnetic actuator) as having a 
non-zero resistance R. As a consequence energy is dissipated in the form of 
heat. The system characterization is otherwise the same as for the previous 
model. The bond-graph representation for this characterization is given in 
Figure 15.5. 

As a consequence equation (7) gets modified to 





( 20 ) 


and the model is given by this and (l)-(3), (10), (13) and (15). 

By manipulating these equations (and using x 2 as given by (16)) we have 


q 0 = ALv l (21) 

/ , = (<?,-He, )/K (22) 

with 

dvjdt = (1 /M){(H/R) ei - (H 2 /R) Vl - Lx 2 - LAp 0 } (23) 

and 


dx 2 /dt = ALv t (24) 

Thus this model overcomes all the drawbacks of Model 1. However, this is still 
not adequate as it ignores various other features - compressibility, air friction, 
etc. 
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Figure 15.5 Bond-graph representation of Model 3. 
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At this point it is worth noting that the complexity of the underlying 
mathematical formulation for both Models 2 and 3 is the same in that they 
each involve two coupled ordinary differential equations. This is in contrast to 
Model I, whose underlying formulation is algebraic. As we make the system 
characterization more detailed, i.e. include more features, the complexity of 
the underlying formulation used in the model also increases. 

4.4 Model 4 

The system characterization for this model differs from that for the previous 
model in that we include the effect of air compressibility. The effect of this is 
that we have one more energy storage element in the system characterization. 
As a consequence (10) needs to be modified, for the output flow rate q 0 *q*. 


The modified relationship is given by 

Po = P* | (25) 

q 0 = q<-Cdpjdt] 

Thus the complete model is given by (1)—(3), (13), (15), (20) and (25). 

On manipulating these equations, we have 

q 0 = ALv x — Cdpjdt (26) 

«, = (<?,- Hv l )/R l (27) 

with 

dx 2 /df = KLv x (28) 

dvjdt = (1 /M){(H/R)(e, - HvJ- LA Pa - Lx 2 ) (29) 


In this model, if q 0 is held constant, then p 4 must fluctuate with time as t>, 
fluctuates. Since p 0 = p 4 , this implies that both the output pressure and flow 
rate cannot be held constant. If the output pressure is held constant, then the 
output flow rate fluctuates and is given by 

q 0 — ALv x 

and i, is given by (27). 

4.5 Model 5 

The system characterization for this model includes losses due to air friction 
resistance and coil resistance. Hence it is an improvement over the previous 
model. Because of the air friction being included (25) gets modified to 

Po = P*~ RoQo ) 

<7o = <?4 - Cdpjdt j 

where R 0 is the resistance coefficient. 


( 30 ) 
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Thus the overall model is given by (I )-(3), (13), (15), (20) and (30). These can 
be simplified to yield 


qo = (p a - Po)/ R o 


(31) 

*i = (et — HvJ/Ri 


(32) 

with 

dp 4 /df = (MC){ALv x -(p 4 - Po)/R 0 ] 

(33) 

dx 2 /d t = KLv x 


(34) 

dv x /dt = (1/M ){(H/R)(e i -HvJ-LAp^- Lx 2 . 

(35) 

On comparison with Model 4 we see that the underlying formulation 
involves three coupled ordinary differential equations. 

4.6 Model 6 



The system characterization for this model is again an improvement over that 
for Model 5, as it includes two new features - damping in the bellows and 
inductance of the coil. The bond-graph representation is given in Figure 15.6. 
As a consequence (20) and (15) get modified to 

e , = e t -i x R- L 0 di x /dt 

<i = 'i 

and 


(36) 

f A = f 3 -K\v 3 dt-Bv 3 


(37) 

& 

II 



where L 0 is the inductance of the coil and B the damping coefficient. 

The complete model is given (l)-(3), (13), (30), (36) and (37). This set of 
equations can be rewritten as 

<?o = (P4 - Po)/Ko j 

'l=«l J 


(38) 

with 

dpJdt = (\/C){ALv x - (p 4 - p 0 )/R 0 } 

(39) 

dx 2 /dt = KLv x 


(40) 

dvjdt = (1 /M){Hi { - LAP* - Lx 2 

- L 2 Bv x ] 

(41) 

diJdt = (l/L 0 ){e i -Hv l -R h } 


(42) 


The underlying formulation is a set of four coupled first-order ordinary 
differential equations. 
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Table 15.2 Relationship between Mathematical Equations of Models I -6 


Model 

1 


2 


3 

4 


5 

6 

2 


2 


2 

2 


2 

2 

3 


3 


3 

3 


3 

3 

7 


7 

—* 

20 

20 


20 

—► 36 

8 

—• 

13 


13 

13 


13 

13 

9 

—♦ 

15 


15 

15 


15 

—♦ 37 

10 


10 


10 —► 

25 

—♦ 

30 

30 


4.7 Discussion 

Since each model is an improvement over the previous one. Models 1-5 are 
special cases of Model 6 . To illustrate this better, we present the underlying 
equations of the models in a tabular form in Table 15.2. The arrows indicate 
the changes resulting from the inclusion of one or more new features. 

Because of (4)-(6) the relationships between variables are all linear. This is 
justified only when changes in the variables are small. When these changes 
become large, the linear relationships of (4)-(6) are no longer valid and one 
needs to use non-linear relationships. As a result the complexity of the 
underlying formulations will increase. Finally, on comparing Model 1 with 
Model 6 , we see that the former uses a static formulation with only three 
parameters while the latter uses a dynamic one with four coupled first-order 
linear ordinary differential equations with ten parameters. We could improve 
further on the last model, but for design purposes it is adequate, for it 
incorporates all significant components of the system. 


5 Analysis 

Since the input voltage is the line voltage which is sinusoidal, and the output 
pressure p 0 a constant, the current drawn (/,) and the flow rate ( q 0 ) are also 
sinusoidal. The variables of interest are the amplitudes off, and q 0 as functions 
of model parameters and input line frequency co. 

We start by rewriting the equations in matrix form. 

Define x, = p A , x 3 = p,, and x 4 = Then (39)—(42) can be rewritten as a 
matrix differential equation of the form 

dX/dt = AX + Bu ( 43 ) 


where 


j* r = [x, x 2 x 3 x 4 ] 
1 “ T = l>, Pol 


( 44 ) 
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A = 


~(\/R 0 C) 

0 

(AL/C) 

0 

0 

0 

KL 

0 

-(LA/M) 

-L/M 

l 2 b/m 

H/M 

0 

0 

— H/L 0 

O 

a: 

1 

"0 \/R 0 C 





B = 


0 

0 

0 


(45) 


(46) 


The analysis can be done either in the time domain or in the frequency domain. 


5.1 Time Domain Analysis 

The equations given by (43) (46) can be solved either analytically or 
computationally to obtain the behaviour of q 0 (t) and /,(f) for specified 
parameter values. As t becomes large, q 0 (t) and i,(r) reach steady state values 
if the equations are stable. 

We shall consider the following nominal values for the ten parameters of the 
model: 

H = 13 Newtons/amp 
L = 0.58 

A = 1.71 x 10 _ 4 m 2 
M = 1. 11 x 1 0 3 Newton s 2 /'m 
K = 2250 ft 
L 0 = 3.9 H 

C = 18.65 x 10" 10 m 5 /Newton 
R 0 = 3.82 x 10* Newton-s/m 5 
B o = 0 (i.e. assumed negligible) 

K = 117 Newton/m 

The input voltage e, is sinusoidal with an amplitude of 220 volts and a 
frequency of 50 cycles s. The output pressure is assumed to be 100 kilo Pascals. 

The equation, given by (43) (46), was integrated from t = 0 to 10.0 with the 
initial conditions assumed zero at / = 0 . Since the equations are fairly stiff, the 
integration was done using Gear's variable-step package from the NAG 
algorithm library. The solution was computed to a relative accuracy of 0.0001 
and the computations done on VAX using double precision. The output q 0 
and the current drawn /, are show n in Figure 15.7 for the time interval [7.93, 
8 . 00 ]. Note that the solutions have not yet reached steady state as the 
amplitudes are still fluctuating. 
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5.2 Frequency Domain Analysis 

Towards this end, let X(s) and u(s) denote the Laplace transform of X(t) and 
u(t). From (43) we have 

X(s) = (sl-A)-'Bu(s) (47) 

Since p 4 (r) = x,(0, we have 

/> 4(0 = *,(f) = CX(f) (48) 

with 

C r = [l 0 0 0] (49) 

From (38), we have 

q 0 (s)={C T X(s)-p 0 (s)}/R 0 (50) 

where q 0 (s) and p 0 (s) are the Laplace transforms of q 0 (t) and p 0 (t ) respectively. 
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After some simplification, we can re-write (50) as 

1 JV,(s> , 1 f N,(s) ) 

<lo S) ~ RjT 0 D ,(i) + R~ 0 {(RoQDJs) ~ 1 j Ms) <51) 

where 

N { (s) = sa l3 a 34 (52) 

N 2 (s) = s 3 + s 2 { -a 33 - a 44 } + s{a 3i a 44 + a 34 tf 43 - ^ 23 ^ 32 } + ^ 23^44 (53) 
D(s) = s 4 + s 3 { -a xl - a 33 - a 44 } + s 2 {a } { a 33 + a l{ a 44 

~ a 32 a 23 + 13^31 } + S { ~ a \\ a 33 a 44 ~ a \\ a 34 a 43 

+ a 23 a 32<*44 ~ <*44ai3<*3 1 } + { “ <> 11 * 23 ^ 2 * 44 } (54) 

(Note: a u ( 1 ^ i, j ^ 4) are the elements of matrix >4.) 


Using s = jw with vv = 2 tt/, we can study the behaviour of N^/D^s) as / 
varies. Since N^syD^s) is complex, it consists of a real part and an 
imaginary part. Defining GAIN as the magnitude of this complex quantity, we 
have GAIN as a function of frequency /. Good design requires that GAIN be 
maximum at / = 50, implying that the pump develops maximum output flow 
rate at / = 50. The plot of GAIN versus /, for the nominal values mentioned 
earlier, is shown by curve A in Figure 15.8. Note that GAIN is maximum at 
/ = 27. 

By changing the lever ratio from 0.58 to 0.68, GAIN was re-evaluated as a 
function of /. The plot is shown by curve B in Figure 15.8. Note, the peak of 
GAIN has shifted from 27 to 35. On the other hand, if A is doubled, then the 



Figure 15.8 Plot of gain vs frequency. 
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plot of GAIN versus / is as shown by curve C in Figure 15.8. In this case, the 
peak has increased but it occurs at the old frequency. 

This type of analysis is useful in selecting the components of the system so 
that it yields the specified value for GAIN at the specified frequency. 

The analyses indicate that the nominal values selected for the parameter of 
the model (or equivalently, the characteristics of the different components) are 
not the best, and that these would need to be altered to achieve a peak for 
GAIN at / = 50. 

6 Conclusions 

In this chapter we have discussed a design problem where the mathematical 
model plays an important role in the engineering task of designing. We could 
have transformed the problem to an optimization problem by defining the 
optimization criterion as the maximization of GAIN at / = 50. Viewed this 
way we have a constrained static optimization problem, since GAIN is related 
to the parameters by an algebraic equation given by (51). However, our 
aim has been to illustrate the iterative nature of model building: starting 
from the simplest model and increasing the complexity till an adequate model 
of least complexity is obtained. 

Notes and References 

Our presentation of the bond-graph approach to the modelling of electrical, 
mechanical and pneumatic systems has been very terse. For further details, see 

[l]-[3]- 

The six models presented are due to Martens and Bell [4]. They give the 
bond-graph representation for all the six models. 

The variables x x , x 2 , x 3 and x 4 in the final model have physical dimensions. 
By scaling them properly we can rewrite (43) in a non-dimensional form and 
overcome the stiffness of the equations. 

The analysis of linear models of the form given by (43) can be found in most 
books on linear system theory. See, for example, [5]. The evaluation of GAIN 
was done using packages from [ 6 ]. 
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CHAPTER 16 


Forecasting Airline 
Passenger Growth 


1 Introduction 

Growth in the number of airline passengers is of great interest to airline 
operators and to planners responsible for developing airports. The importance 
of forecasts to airline operators is two-fold. In the short term it allows for more 
efficient scheduling, and in the long term it guides the expensive planning 
decisions regarding purchase of new planes. If decisions are based on poor 
forecasts, the economic viability of the airline operator is at stake. 

In this chapter we look at the problem of building mathematical models to 
produce forecasts of growth in the number of airline passengers. We 
commence with a system characterization relevant for model building in 
Section 2. A comprehensive model incorporating all the relevant variables is of 
limited use due to problems associated with the data needed for estimation and 
validation; instead we shall use the "black-box" approach. The advantage of 
this is that the model is built and validated using only the available data 
(namely, the monthly figures for airline passengers). The approach to 
modelling is discussed in Section 3, where we also give the data that will be 
used for model building and validation. In Section 4 we carry out some 
preliminary analysis of the data to decide on the most suitable mathematical 
formulation for modelling. Since the modelling is highly data-dependent we 
briefly discuss this aspect in Section 5. In Section 6 we examine the problem of 
estimating model parameters. In Section 7 we present a sequence of models 
leading to an adequate one. In Section 8 we discuss some extensions and 
possible refinements. 


2 System Characterization 

In this section we examine the relevant variables which affect the growth in the 
number of airline passengers and other factors such as time interval for 
aggregation and the appropriate framework for modelling. 
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2.1 Variables and Relationships 

Since we are interested in aggregated numbers, let N t represent the number of 
passengers flying in the interval t = /A to (/ -F 1 )A where A is specified. We shall 
discuss the choice of A later on. /V, could represent the number of passengers 

(i) travelling within a certain geographical region; or 

(ii) travelling by a certain airline carrier; or 

(iii) travelling within and moving out of a certain geographical area. 

What /V, represents depends on the need of the forecaster. For this case study 
/V, represents the number of airline passengers either moving into or out of the 
United Kingdom (U.K.). 

/V, depends on many other variables. Some of the important ones are: 

(i) cost of travel (e.g. average cost/km); 

(ii) the quality of service relative to other modes of travel; 

(iii) advertising; 

(iv) the general state of the national economy. 

Some of these in turn might depend on other variables - for example, the 
price of aviation fuel and the financial state of the airline operator. 

The relationships between the variables can be fairly involved. As an 
example, the number of passengers carried depends on advertising, which in 
turn depends on the profitability of the airline; finally, the profitability in turn 
depends on the number of passengers carried. The dependence involves 
varying time lags which are difficult to quantify. Similarly, it is difficult to 
quantify certain variables - for example, quality of service - in a meaningful 
way. 

This suggests that a "black-box" approach is the most appropriate, and we 
discuss this in the next section. 

2.2 Interval for Aggregation 

Since the forecasts of interest are the aggregated values, the choice of the time 
interval over which the number of passengers is to be aggregated is important. 
We represented this interval by the symbol A in our earlier characterization. 

If A is large, then we have a coarse description, since the aggregation is done 
over a long interval. As A gets smaller the description becomes finer. The 
choice of A is part of the art aspect of modelling. We shall choose A to equal 1 
month for the following two reasons. 

(i) statistics for passengers carried are usually compiled on a monthly basis; 
and 

(ii) smaller A (say 1 week) is too fine and of limited use. 
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For long-term planning one might choose A equal to 3 months or 6 months, or 
even a year. 


2.3 Deterministic vs Stochastic 

Since N t is affected by many variables, some of which change in an uncertain 
manner, the most appropriate framework for characterizing is the 
stochastic framework. The modelling of N, involves a stochastic formulation 
with uncertainty characterized explicitly. The variables representing un¬ 
certainty can also be viewed as representing the cumulative effect of some of 
the relevant variables ignored in the modelling. 


3 The Black-box Approach 

The “black-box” approach to modelling that we will follow is to characterize 
N i solely in terms of its previous values (i.e. j Js I) and through an 
uncertain (or random) sequence Wj (;'</). The model is given by a 
mathematical formulation which relates N t to its previous values and the W 
(j ^ /) sequence. The formulation is selected on an ad-hoc basis and revised 
based on the fit between the model output and the historical data available. 
Thus, the modelling can be viewed as a “ data-dependent" one. The precise 
details of the mathematical formulation that we will be using will be discussed 
in Section 5, after we have examined the data available for modelling and 
estimation. The modelling approach that we will be taking is called “stochastic 
time series modelling" and often the word stochastic is ignored and the term 
used is “time series modelling". This formulation was discussed in some detail 
in Section 7.3 of Chapter 7. 

The data we will be using for modelling are given in Table 16.1. This shows 
the monthly aggregates for the number of airline passengers for the U.K.. 
obtained from the monthly statistics published by the government. The data 
cover the period January 1966 February 1985, and we regard January of 1965 
to correspond to /= 1. As a result the data available for modelling and 
validation are given by w„ !</«$/ with / = 230.* 

We are interested in obtaining forecasts for the period March 1985 
December 1988; in other words, obtaining estimates of /V, for 231 $ / < 264. f 


•We follow the standard convention for stochastic variables - i.e. upper-case letters represent the 
variables and lower-case letters represent the realized values. Thus N, represents the random 
variable and n i the observed (or realized) value. 

f Rather than model N i% we model N, scaled by a factor of 10 6 . We shall henceforth use N, to 
represent this scaled variable. As a result we would need to scale model forecasts and variances by 
10 and 10’*. respectively, to obtain the true values. 
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4 Analysis of the Historical Data 

A plot of the historical data of Table 16.1 is given in Figure 16.1, and reveals the 
following: 

(i) the data can be viewed as consisting of a deterministic component and a 
stochastic component; 

(ii) the deterministic component consists of a trend (or growth) component 
and a seasonal component; 


Table 16.1 Historical Data for Airline Passenger Growth (I0 6 ) 


Year 



Data 




1966 

0.6090 

0.5496 

0.7225 

0.9616 

1.0948 

1.4076 


1.6092 

1.5689 

1.3546 

0.8862 

0.6105 

0.6833 

1967 

0.6519 

0.5878 

0.8539 

0.9253 

1.1213 

1.3201 


1.5517 

1.5968 

1.4496 

0.9690 

0.6335 

0.6575 

1968 

0.6402 

0.6024 

0.7942 

0.9939 

1.0515 

1.2692 


1.4888 

1.5567 

1.4016 

0.9563 

0.7001 

0.7687 

1969 

0.7121 

0.6265 

0.8720 

1.0125 

1.2084 

1.3536 


1.5669 

1.6417 

1.4698 

1.0885 

0.8142 

0.8492 

1970 

0.7866 

0.7323 

1.0057 

1.0957 

1.2596 

1.3408 


1.5847 

1.6539 

1.5317 

1.1299 

0.8467 

0.8991 

1971 

0.7543 

0.7643 

0.9865 

1.2122 

1.2937 

1.4813 


1.6761 

1.6916 

1.6115 

1.2276 

0.8250 

0.9342 

1972 

0.8970 

0.7923 

1.0760 

1.2495 

1.4566 

1.5581 


1.7049 

1.7390 

1.7785 

1.4007 

1.0239 

1.0904 

1973 

1.0214 

0.9155 

1.2193 

1.4605 

1.5469 

1.6878 


1.8305 

1.8793 

1.8930 

1.4976 

1.2114 

1.2011 

1974 

1.1066 

0.9873 

1.1860 

1.3993 

1.3434 

1.5904 


1.7031 

1.7424 

1.7901 

1.3910 

1.0667 

1.0895 

1975 

1.0342 

0.8976 

1.1930 

1.2234 

1.3728 

1.4647 


1.7272 

1.8009 

1.7546 

1.5015 

1.1217 

1.1409 

1976 

1.1119 

1.0150 

1.2966 

1.4164 

1.4916 

1.6686 


1.7734 

1.8018 

1.8135 

1.5299 

1.2304 

1.2366 

1977 

1.7337 

1.1013 

1.4106 

1.1071 

1.5193 

1.7188 


1.8541 

1.6884 

1.5820 

1.3719 

1.2766 

1.2795 

1978 

1.2490 

1.1498 

1.5474 

1.5851 

1.7619 

1.9041 


2.1154 

2.1080 

2.0637 

1.8177 

1.5152 

1.4168 

1979 

1.2628 

1.2367 

1.5905 

1.8749 

1.9859 

2.1109 


2.3573 

2.4409 

2.2805 

2.0581 

1.5985 

1.5130 

1980 

1.4713 

1.4042 

1.7318 

1.7910 

1.8944 

2.0760 


2.2716 

2.3677 

2.1611 

1.9428 

1.5413 

1.5378 

1981 

1.4905 

1.3706 

1.6129 

1.8020 

1.7615 

1.8558 


2.1356 

2.2615 

2.1572 

1.9283 

1.5705 

1.4140 

1982 

1.4971 

1.3207 

1.5942 

1.7416 

1.7356 

1.8747 


2.1364 

2.0586 

1.9945 

1.7847 

1.4396 

1.4304 

1983 

1.3962 

1.2560 

1.6144 

1.6364 

1.7379 

1.8764 


2.0450 

2.0074 

2.0417 

1.8177 

1.4822 

1.4733 

1984 

1.4428 

1.3594 

1.6228 

1.8185 

1.9519 

2.1055 

1985 

2.2425 

1.6115 

22569 

1.4765 

2.2878 

2.0798 

1.7389 

1.6506 


Source. Reference [1] 
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(iii) the trend component is linear in time and the seasonal component 
exhibits a periodic pattern with a period of 12 months. 

The reasons for the linear trend are: population growth, air travel becoming 
relatively cheaper than other modes of travel, and the increase in the general 
standard of living which allows more time for leisure. The periodic pattern is 
due to seasonal factors - passenger numbers increasing in the summer 
months, which coincide with the holiday season as well as being the best time 
to travel. The stochastic component is due to various uncertainties which 
cause random variations. 

Thus, Ni can be viewed as comprising the following components: 

Ni-Dt + Pt+Wt ( 1 ) 

where D, and P , represent the deterministic trend and periodic components, 
and W { the stochastic component. 

If we assume that the deterministic trend is linear, D, is of the form 

Di = a + bi ( 2 ) 

By differencing, we have 

Di ~ A -1 =b (3) 

and 

(D t — i) — (D,_ , — D,_ 2 ) = 0 (4) 

i.e. the trend component in N { can be eliminated. 

Motivated by the need for a compact notation, we define two operators - 
the backward difference operator and the backward shift operator. The /cth- 
order backward difference operator V*, 1, is defined as 


V*Af| = N,-Ar,-* 


(5) 
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and the kth-order backward shift operator B k , 1, is defined as 

B k N ( = Ni- k (6) 

When/c = 1 we use V and B instead ofVj and B,,and the two are related by the 
relation 


V=1 -B (7) 

As a result (3) and (4) can be rewritten as 

VD, = b (8) 

and 

V 2 D, = 0 (9) 

where V' is the operator V applied j times. 

Since the periodic component has a period of 12 months, if we assume that 
P ( is of the form 

P ( = A sin (7ri/6 + Q) (10) 

where A and Q are constants, then taking a twelfth-order difference we have 

V l2 P| = (l-fli 2 )P,-0 (II) 

Thus the periodic component can be removed by such a differencing. 

With this background we now define two new time series - F, and Z, - 
which are as follows: 


K f = VN ( , 2 

(12) 

Z, = V l2 Y h 14 

(13) 


Note that in terms of the backward shift operator (12) and (13) can be written 

as 

Y ( = (\-B)N h 2 (14) 

and 

Z t = (l-m-B l 2 )N h 14 (15) 

We can view T, as the time series obtained by the removal of the linear trend 
from N h and Z, as the time series obtained by the removal of the linear trend 
and the periodic component. The plots of y, and z, are given in Figures 16.2 
and 16.3, respectively. Note that z f is fluctuating about a mean value z = 
-0.00131, obtained by the expression 

230 

Z= I Zj/216 (16) 

1=14 

Since z is very close to zero, we can ignore it. We assume that the Z, series is 
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Figure 16.3 Plot of 2 , and 1 (trend and seasonality removed). 

weakly stationary and ergodic.* As a result, estimates of the autocorrelations 
of Z, can be obtained from the historical values of z„ 1 ^ 1. Let v(k) represent 
the autocovariance of lag k , i.e. 

v(*) = E[Z, Z i+k ] (17) 

Then, v(k), an estimate of v(k), can be obtained by the relation 

230-fc 

m= I (Z(+* — 2)(Z( — z)/(216 — k) (18) 

i- 14 

Rather than v(/c), we shall deal with the autocorrelation p(k ) given by 

p(/c) = v(*)/v( 0) (19) 

•Weakly stationary implies that the mean £{Z,} does not change with 1 and the autocovariance 
E{Z { Zi + *} is a function of only k. Ergodicity implies that the ensemble averages can be obtained 
by time averaging - e.g. 

N 

E{z ( } * lim £ ZJN , and so on. 

N-cc 1 = 1 
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Figure 16.4 Plot of p(k) vs k . 


Let p(/c) represent the estimated value of p(k). A plot of p(k ), 0 ^ k ^ 25, is given 
in Figure 16.4. Note that p(k) is significantly different from zero for k > 0, 
indicating that Z is a correlated sequence. (Since p(k) = p(- k), we shall 
confine our discussion to k > 0.) 

The above statement, that Z, is a correlated sequence, is based on visual 
assessment. A rigorous procedure is needed to establish this fact. This can be 
done by a hypothesis-testing procedure discussed in the next section. We omit 
details of the tests; these can be found in the reference cited in the Notes at the 
end of the chapter. 


4.1 A Test for Serial Correlation 

Suppose that Z, is an uncorrelated Gaussian sequence with zero mean and 
finite variance. Let p(/c), 1 < k ^ K be the sample autocorrelation obtained 
using the available data z h 14^/^216. Then, the variable STA given by 

STA = 216 £ & 2 (k) (20) 

*=» 1 

is distributed according to a x 2 distribution with K degrees of freedom. 

We can use the above result to test the null hypothesis that Z, is an 
uncorrelated sequence. Using the values of p(k) given in Figure 16.4, we have 

STA = 135.583 (21) 

and K = 25. From the tables of x 2 distribution with 25 degrees of freedom, the 
null hypothesis, that Z, is an uncorrelated Gaussian sequence with zero and 
finite variance, can be accepted as being true with 95 per cent confidence only if 
STA is less than 37.7. Since the computed STA is greater than 37.7 we must 
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reject the null hypothesis. In other words we must accept the alternative 
hypothesis that Z, is a correlated sequence. 

5 Modelling Methodology 

We first build a stochastic model to describe the changes in Z, using the 
historical data available. Once this is done we obtain forecasts Z, of Z, for 
* > 230 from the model. Using this in (15) we then obtain the forecast N ( 
for 230</<264. Note that the forecasts cover the period March 1985 to 
December 1987. 

5.1 Modelling the Z Sequence 

Since we have assumed that the Z, sequence is a Gaussian ergodic process, the 
mathematical formulation most suited for modelling Z, is an autoregressive 
moving average (ARMA) formulation. We discuss two different structures for 
the model formulation. 

Structure I 

The structure of the model formulation is given by the relation 

z l +t»jz t -j=y i +ip j y l - j ( 22 ) 

;=i ;=i 

subject to the following conditions: 

(Cl) / and m are non-negative integers with at least one of them being non¬ 
zero; 

(C2) the roots of the following two polynomial equations 

Q,(x) = x' + £ <Xjx'- J = 0 
and 

QiM = x m + £ PjX m ~ J = 0 (23) 

j=l 

all lie strictly inside the unit circle. 

(C3) V { is a sequence of independent and identically distributed Gaussian 
random variables with zero mean and variance a 1 . 

Some comments are appropriate at this stage. 

(i) If both / and m are zero, then Z, is an independent sequence and hence not 
appropriate for modelling. 

(ii) The root condition ensures that Z,- is stationary and ergodic. 
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(iii) Since the formulation is linear, and V { a Gaussian sequence, Z, is also 
a Gaussian sequence. 

(iv) The parameters are given by the set 

0 = {/, m\ a 7 ,1 *£y ^ /; 1 *Sy ^ w; o 2 } (24) 

i.e. the total number of unknown parameters is (/ + m + 3). The parameters are 
constrained - that is, a ; , 1 ^ / and (lj , 1 ^y ^ m must ensure that the root 

condition (C2) is satisfied; / and m satisfy condition (Cl), and o 2 by definition 
must be positive. 

Structure II 

This is a slightly more general formulation to model Z, and is given by the 
following relation: 

Z l +£a J Z l - J .-W,+ f i bjW l _ M (25) 

J m * 7 = 1 

with W { given by 

w i+ t *jW k -j = K + IPjK-j (26) 

7=1 7=1 

subject to the following conditions: 

(Cl') /,w,/' and m' being non-negative integers with either / or m being 
positive; 

(C2') the roots of the polynomial equation given by (23) and of the following 
equations 

Qi(x) = x r '+ X a J x il '-* = 0 

7=1 

and 

e 4 (x) = x"' 1 + £ bjX (m '~ JU = 0 (27) 

7=1 

all lie inside the unit circle. 

(C3') Same as in (C3). 

This model formulation contains more parameters than the model formula¬ 
tion discussed earlier. The additional parameters are given by the set 

0) = {s; /', m'\ a p 1 s£y ^ /'; b j9 1 <y ^ m'} (28) 

The advantage of this formulation is that it allows for correlation at lags which 
are multiples ofs. Note that by setting /' and m' to zero this formulation reduces 
to the earlier formulation. We assume s= 12. 
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5.2 Modelling Procedure 

We assume that the model formulation is given by (25) and (26) subject to 
constraints (C1')-(C3')- We divide the model parameters into two sets as 
follows: 

0, ={/,m,/\m'} (29) 

and 

<=>2 = {a;J and a 2 } (30) 

The first set characterizes the order of the model formulation. The model 
complexity increases as the values assigned to the parameters of this set 
increase. The second set depends on the first set and represents the coefficients 
of the model formulation and the variance of the random input. 


Initial model 
specification 


Step -1 
Para est 


Step-2 

Forecasts 


Step-3 
Validation 


Step-5 

Forecasts 



Figure 16.5 Model building procedure. 
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The modelling is done in an iterative manner shown in Figure 16.5. 
Essentially it consists of the following four steps. 

Step 1: Estimate the parameters of set © 2 . The estimation is done using z h 
14 < / ^ 200. The details of the estimation procedure will be discussed in the 
next subsection. 

Step 2: Using the estimated values, forecasts Z, are obtained for 
201 ^ i ^ 230. This is done by using (25) and (26) and assuming zero values 
for V it 201 ^ i < 230. 

Step 3: The forecast values Z, are compared with the historical data z f for 
201 ^ i ^ 230. A validation criterion is used to decide whether the model 
under consideration is adequate or not. We discuss this aspect in some detail 
in the next section. 

Step 4: If the validation test in Step 3 implies inadequacy of the model, the 
model is altered by changing the values assigned to one or more of the 
parameters of set and the procedure repeated. 

Step 5 : This step is executed only after the validity of the model has been 
established, and is done in a manner similar to Step 3 to obtain the forecasts 
for the period 231 ^ i ^ 265. 

The iterative procedure is initiated by an initial specification for the 
parameters of set 0^ We start with the simplest specification: 

/' = m! = m = 0 and / = 1 

We first consider only varying / and m (in other words. Structure I model 
formulation) and later varying either /' and/or m' from zero (in other words, 
Structure II model formulation). The reason for this is that our aim is to build 
an adequate model of least complexity. 

We use the time series packages from the NAG library to execute all the 
steps, except Step 4. Step 4 is done manually, using a simple strategy of altering 
only one parameter of the set Q { at each stage of the iteration. We omit the 
details of the program, as discussing it will cause a significant digression from 
the main theme: that of building an adequate model. Further discussion of the 
NAG package can be found in the Notes at the end of the chapter. 

It is worth noting that we have divided the available data z„ 1 < i < 230, into 
two parts. The first part, comprising 1 < / < 200 (or approximately 16 years), is 
used for parameter estimation and the second part, comprising 201 ^ i ^ 230 
(or 2\ years), is used for model validation. This is done to ensure that the data 
used for validation are different from these used for parameter estimation. The 
breakpoint in the data (which corresponds to i = 200) is arbitrary. 

5.3 Parameter Estimation 

The parameter estimation in Step 1 can be done using more than one method. 
The package in the NAG library estimates the parameters by the method of 
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maximum-likelihood. As discussed in Chapter 10, Section 4, this method 
provides statistically the most efficient estimates of parameters. In addition, 
the package provides extra statistical results useful for the purpose of 
validation. 

The maximum-likelihood estimates of the parameters of the set 0 2 (except 
o 2 ) are obtained by minimizing J given by 

200 

j= X vf (31) 

I = 14 

where £,,14 ^ i ^ 200, are given by 

m / 

+ X PA-j = **’, + X a A-j (32) 

j =i j =i 

and w h 14^/^ 200, are given by 

m' V 

*.+ X bjW i _ sj = z i + X OjZi-j, (33) 

j =i j =i 

with s = 12. 

The computer package starts with some nominal value for the parameters of 
the set © 2 . For a specified parameter value it computes J using (31) by solving 
for Vj using (32) and (33). In other words, it computes J as a function of the 
parameters being estimated. Using an iterative optimization scheme it 
computes the optimal value of the parameters which minimize J , at the same 
time ensuring that the root constraints are not violated. The optimizing values 
are the maximum likelihood estimates. The estimate <x 2 of a 2 is obtained by the 
relation 

<r 2 = J/186 (34) 

where J is evaluated using the maximum-likelihood estimates. 

6 Model Validation 

To establish the adequacy of a given model (in other words, to validate a given 
model) one needs to define a suitable criterion or test. Validity of the model 
can be established by 

(i) checking the validity of the underlying assumptions; 

(ii) goodness-of-fit between the model and the actual data; 

(iii) combination of (i) and (ii). 

In this section we will discuss (i) and (ii). 

6.1 Validation Based on Model Assumption 

Since we assumed that Vj is an uncorrelated Gaussian sequence, if the model is 
an adequate representation then the sequence 6 it 14 ^ ^ 200, can be viewed as 
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estimates of V t . Thus the validity of the model can be established by checking 
for the lack of correlation in the 0, sequence. 

Towards this end, we define the following statistics: 

STA = 186 X p 2 (k) (35) 

k = 1 

where p(k) is the estimated autocorrelation coefficient of lag k,\^k^K, using 
. the sequence v b 14 ^ i ^ 200. If v ( is a realization of an independent sequence, 
then STA is approximately distributed according to a x 2 distribution with 
{K — (/ + m + /' + m')} degrees of freedom (see Box and Jenkins [2], p. 291). 
Thus, by choosing a sufficiently large K , we can devise a test for validating the 
model (or establishing adequacy) as follows. 

Compute STA using (35) and test whether the computed value is below a 
number L obtained from the tables for x 2 distribution with {K - (/ + m + /' 
+ m’)} degrees of freedom for a specified confidence limit. If STA is bigger than 
L, then we reject the model as being inadequate. If not (i.e. STA is less than L), 
then we can either accept the model as being adequate or subject the model to 
further testing. We will use K = 25 and 95 per cent confidence level for testing. 

An alternative test for model adequacy is as follows. If r, is a realization of an 
independent sequence with zero mean, then p r (fc) estimated using 186 data 
points is distributed (approximately) according to a Gaussian distribution with 
zero mean and variance roughly equal to 1/186. Hence, in this case p(k) must 
lie within ± 3/^/186 if the assumptions are valid. Thus, we accept a model as 
being adequate if p v (k\ 1 ^ k ^ K (K = 25) lie inside this band; otherwise we 
reject the model. 

6.2 Validation Based on Forecasting 

Since the aim of the model is to forecast, a suitable test is one which compares 
the goodness-of-fit between the model prediction and the historical data. This 
is possible in our case, since we have data z„ 201 ^ ^ 230, which we set aside 
for this purpose. 

The forecast, or predictive, ability can be defined through various measures. 
One which we shall use is the sum of error squares given by 

230 

ESQR = X (Zi-zf (36) 

i = 201 

where Z, is the forecast values obtained from the model. 

A test which we will use for deciding the validity of a model is as follows. 
Accept the model as an adequate representation if ESQR/30^<5, otherwise 
reject the model. Since the model is stationary, <5 > o\ where c\ is the variance 
of Z, For our test we shall use S = go 2 , where g is a positive constant greater 
than one. Since o\ is unknown, we use an estimate obtained from the historical 
data. This yields a value 0.01512 for a 2 . We shall choose# = 3. Note that as we 
make g smaller (bigger) the test becomes more (less) stringent. 
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The methodology discussed in Section 5 was used to build and test a variety of 
models with /' = m' = 0 (i.e. Structure I formulations) for all possible combin¬ 
ations of / and m with (/ + w)= 1,2,3,4,5 and 6. Table 16.2 shows the 
computer output for a sample of the models tested. It gives the order of / and m; 
the statistics of the residuals - viz. a 2 , STA and the degrees of freedom 
(25-/'-m'-/-m); the statistics of the forecast performance ESQR; and finally, L 
obtained from the tables of * 2 distribution for the purposes of validation. For 
all the models tested, STA is greater than L; hence no model can be accepted as 
being adequate. The reason for this can be seen from Figure 16.4; it shows that 
p 2 (k) is significant for k = 12 and 24. In other words there are correlations of 
lag 12, the period for the seasonality in the data. This suggests that one should 
use models involving Structure II formulation as opposed to Structure I 
formulation. 

A variety of models were built and tested for all possible combinations of 
I' + m' = 1 and 2 and / + m = 1,2,3,4,5 and 6. For some combinations the 
package failed to yield estimates, due to its inability to find a stationary 
model to fit the data. These combinations were ignored. Table 16.3 shows 
the computer output for a sample of the successful models tested. Models 
M03001 and M04001 yielded the best results. For both models, STA is less 
than L as well as ESQR/30 being less than 3<x 2 , indicating that we accept 
both the models as being adequate. Note that model M03001 has a slightly 
smaller value for STA and a slightly bigger value for ESQR when compared 
with the corresponding statistics for model M04001. Since the differences 


Table 16.2 Computer Results for Model Structure I 


Model 

/ 

m 

STA 

D.F. 

(X 2 

ESQR 

L 

M01000 

1 

0 

93.9812 

24 

0.01288 

4.40985 

36.4 

M02000 

2 

0 

92.2716 

23 

0.01284 

4.31495 

35.2 

M03000 

3 

0 

49.1419 

22 

0.01095 

2.87651 

33.9 

M04000 

4 

0 

44.3280 

21 

0.01071 

2.54135 

32.7 

M05000 

5 

0 

39.8483 

20 

0.01048 

3.04159 

31.4 

MO3I00 

3 

1 

46.9353 

21 

0.00977 

3.65098 

32.7 


Table 16.3 Computer Results for Model Structure II 


Model 

/ 

m 

r 

m 

STA 

D.F. 

(X 2 

ESQR 

L 

MO1001 

1 

0 

0 

1 

49.2002 

23 

0.00806 

1.15245 

35.2 

M03001 

3 

0 

0 

1 

29.2600 

21 

0.00799 

1.19957 

32.7 

M04001 

4 

0 

0 

1 

31.3748 

20 

0.00788 

1.03444 

31.4 

M04010 

4 

0 

1 

0 

44.5310 

20 

0.00957 

3.22186 

31.4 
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are not significant, we accept model M3001 for the simple reason that it has 
one less parameter compared to model M4001. Thus the final model is given 
by (25) and (26) with 

/'= 0, m' = 1, / = 3 and m = 0 
and the following values for the parameters. 

а, = -0.3337, a 2 =-0.1041, a 3 =-0.3344, 

б, =0.7826 and a 2 = 0.0080. 

The model forecasts for the period 201^/^230 are plotted in 
Figure 16.6, along with the historical data for the same period. The match 



Figure 16.6 Validation of Model M03001. 



Figure 16.7 Forecasts of Model M03001 until December 1988. 



















292 


MATHEMATICAL MODELLING 


between the two is fairly good. New parameter estimates based on all 230 
data points (as opposed to the first 200 data points only) are as follows. 

oil = — 0.3263, ot 2 = - 0.1093, a 3 =-0.3230, 
b x =0.8008 and <r 1 2 3 4 5 6 7 = 0.0083. 

Using these estimates the forecasts for the period 231 ^ i ^ 270 are given in 
Figure 16.7. 

8 Conclusions 

In this chapter we have examined a “black-box” approach to modelling, where 
the modelling is data-dependent. We used an ARM A formulation to model the 
sequence obtained after removing the linear trend and seasonality in the 
original data. An alternative approach would have been to model a suitable 
transformation of the data by an ARMA formulation - e.g. logarithmic 
transformation of the original data as opposed to the data themselves. 
Whether such models perform better than the model developed here can only 
be established by building such models and testing them against that built in 
this chapter. The main advantage of the model structure used here is that it 
allows the use of a standard computer package to carry out model building, 
validation and forecasting. 

Notes and References 

The historical data used in this case study were obtained from [1]. 

Further details of the modelling process can be found in any one of the large 
number of books on time-series modelling; [2]-[6] give a small sample. In 
particular [2] is a classic, and it discusses the various statistical tests 
mentioned in this chapter. 

We have confined ourselves to linear models; [7]-[9] deal with non-linear 
models and models with stochastic coefficients. 

We used the time series package from [10]; similar packages can be found 
elsewhere - e.g. [11] and [12]. 
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CHAPTER 17 


Modelling Exercises 


1 Introduction 

As mentioned in the Preface, mathematical modelling is best learnt by building 
models and learning through progressive improvements in the iterative 
process. This chapter contains a variety of real-world problems for use as 
exercises in model building. 

The problems have been grouped into four categories: biological sciences, 
physical sciences, social sciences, and engineering and technology. This is done 
primarily to help the student choose a problem from a discipline of interest. 

For each problem we give a brief description and a short list of relevant 
references. These references contain information to help understand the 
physical aspects of the problem and a discussion of models built in the past. 
An ideal format for carrying out the modelling exercise is as follows: 

Step 1: Select a problem of interest. 

Step 2: Based on prior knowledge and, if needed, additional information 
obtained from relevant books (for example, by consulting a subject 
catalogue in the library) carry out a system characterization and build a 
mathematical model. 

Step 3: Carry out a detailed search using one or more abstracting or indexing 
journals (for a list of journals, see Chapter 18) to locate models built in the 
past. For most problems listed in this chapter, a proper search should yield 
more than one model. 

Step 4: Compare critically the model in Step 2 with those from Step 3, 
considering aspects such as model validity, model complexity, data 
availability, etc. 

Step 5: Revise the model in Step 2 in light of the experience gained in Step 4. 

The importance of this sequence is that it forces the student to think deeply 
about the problem and make an attempt without the thoughts being coloured 
by the attempts of earlier model builders. However, the earlier models play an 
essential role as comparators to test ideas and to guide the learning 
process. 
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2 Biological Sciences 

2.1 Cancer Growth 

The development of cancer involves two phases - transformation and growth. 
In the transformation phase a normal cell changes to an abnormal cell capable 
of generating a tumour. In the growth phase the abnormal cell multiplies by 
cell division at a rate higher than that for a normal cell, thus creating tumour 
growth. In the case of a malignant tumour the growth is so invasive that it can 
kill the host. The growth of a tumour can be controlled by various means e.g. 
radiation or chemotherapy. The problem facing medical researchers is to 
compare the effectiveness of different policies and to find the most effective 
policy to control cancer growth. Build a mathematical model to describe 
growth with and without control action. 


References 

[1] Eisen, M. M.: Mathematical Models in Cell Biology and Cancer Chemotherapy. Springer- 
Verlag, New York, 1979. 

[2] Steel, G. G.: Cell Population Kinetics in Relation to the Growth and Treatment of Cancer. 
Oxford University Press. Oxford, 1977. 

[3] Bertuzzi, A. et al. : Mathematical models of the cell cycle with a view to tumor studies. Math. 
Bio. Sci. , 53(1981), 159-88. 

[4] Segel, L. A.: Modeling Dynamic Phenomena in Molecular and Cellular Biology. Cambridge 
University Press, London, 1984. 

[5] Whittemore, A. and Keller, J. B.: Quantitative theories of carcinogenesis. SIAM Rev., 20 
(1978), 1-30. 


2.2 Human Ear 

The human ear essentially transforms sound or acoustical signals into 
electrical impulses for transmission to the brain for processing. Sometimes, 
due to physiological defects, the ear is incapable of carrying out its normal 
operations. Often the problem can be rectified through surgical means but in 
some cases this method does not work. An alternative method for helping such 
people is through an electromechanical device which in essence is an artificial 
ear. The problem facing bioengineers is to build an artificial ear whose 
dynamic behaviour is very similar to that of a normal ear. Build a 
mathematical model of the dynamics of the human ear to help bioengineers 
when designing an effective artificial ear. 


References 

[1] Tobias, J. V.: Foundations of Modern Auditory Theory , vols I and II. Academic Press, New 
York, 1970-2. 

[2] Moller, A. R. (ed.): Basic Mechanisms in Hearing. Academic Press, New York. 1973. 
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[3] Inselberg, A.: Cochlear dynamics: the evolution of a mathematical model, SIAM Rev.. 20 
(1978), 301-51. 

[4] Schroeder, M. R.: Models of hearing, Proc. IEEE. 63 (1975), 1332-50. 


2.3 Human Joint 

Human motion (e.g. walking, liftingjumping) involves dynamic action at one 
or more joints in the body. If the joints function normally they cause no 
discomfort. In many instances a joint does not function normally and causes 
severe pain when in motion. Typical situations are: joints getting worn with 
age, the onset of arthritis, joint damage due to an accident, and so on. One way 
of reducing the pain and discomfort is to replace the defective joint with an 
artificial one. For the artificial joint to be effective it must be capable of 
executing all the motions of a normal joint. 

Consider a specific joint - e.g. the hip joint. Build a mathematical model to 
describe the dynamics of the joint so that it is useful to bioengineers when 
designing hip replacements. 


References 

[1] Ghista. D. N. and Roaf, R. (eds): Orthopaedic Mechanics: Procedures and Devices. Academic 
Press, London, 1978. 

[2] Engin, A. E. and Mocinzadeh, M. H.: Dynamic modelling of human articulating joints. Math. 
Modelling. 4 (m3), 117-41. 

[3] Marshall, E. A.: A dynamical model for the stride in human walking. Math. Modelling , 4 
(1983), 391-415. 

[4] Pierrynowski, M. R. and Morrison, J. B.: A physiological model for the evaluation of 
muscular forces in human locomotion: theoretical aspects. Math. Bio. Sci., 75(1985), 69-101. 


2.4 Renewable Resource 

A wide variety of sea life (e.g. fish, crabs, lobsters) are consumed by humans. If 
the rate of harvesting is higher than the rate of natural increase, the sea life 
population will decrease and, in the worst case, the species can become extinct. 
Various international agencies are concerned with this issue and the problem 
facing them is to decide whether a particular species is to be harvested or not 
and, if harvesting is to be allowed, what limits should be placed on it. Build a 
mathematical model which will help the agencies in making sensible decisions. 


References 

[1] Clark, C. W.: Mathematical Bioeconomics: the Optimal Management of Renewal Resources. 
Wiley, New York, 1976. 

[2] Cushing, D. H.: Fisheries Biology: A Study in Population Dynamics 2nd edn. University of 
Wisconsin Press, Madison, Wl, 1981. 

[3] Haley, K. B. (ed.): Applied Operations Research in Fishing. Plenum Press, New York, 1981. 
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[4] Gerking, S. D.: Ecology of Freshwater Fish Production. Halsted Press/Wiley. New York, 1978. 

[5] Clark, C. W.: Mathematical models in the economics of renewable resources. SIAM Rev., 21 
(1979), 81-99. 


2.5 Hospital Operation 

In most Western countries medical costs are increasing at a very fast rate. There 
is a strong pressure on hospital administrators to reduce operating costs. 
Large hospitals provide a variety of medical services to a community. One way 
of reducing hospital costs is to optimize the allocation of resources (e.g. beds, 
doctors, nurses) to types of activities (surgical, coronary, pediatrics, etc.) which 
the hospital must provide. If the allocation is not optimal some of the resources 
are under-utilized while others are over-strained. In either case the return on 
investment is not optimal. Build a mathematical model which will help the 
hospital administrators reduce the cost of operating hospitals. 

References 

[ 1 ] Ackerman, E. and Gatewood, L. C\: Mathematical Models in the Health Sciences: A Computer 
Aided Approach. University of Minnesota Press, Minneapolis, MN, 1979. 

[2] Fries. B. E.: Bibliography of operations research in health care systems. Oper. Res., 24 (1976), 
801-14. 

[3] Pierskalla, W. P.: Some mathematical models in health planning. In Operations Research. 
Mathematics and Models , S. I. Gass (ed.), AMS, Providence, RI, 1981, pp. 105-50. 

[4] Ashford, J. R.: Resource allocation in health services management. Bull. IMA, 12 (1976), 47- 
55. 


2.6 Blood Circulation 

Proper flow of blood is essential to transmit oxygen and other nutrients to 
various parts of the body in humans as well as in all other animals. When 
blood flow is restricted it can cause damage ranging from minor discomfort to 
death, in the worst case. The flow of blood is influenced by the physiological 
characteristics of blood vessels - e.g. elastic properties of the wall material, 
deposition on wall surfaces and so on. These physiological characteristics 
change with age, eating habits and many other factors. A problem of interest to 
the medical profession is the relationship between blood flow and physiolog¬ 
ical characteristics of a blood vessel. Build a mathematical model to 
characterize this relationship. 


References 

[1] Bergel, D. H.: Cardiovascular Fluid Dynamics. Academic Press, London, 1972. 

[2] Lighthill, M. J.: Mathematical Biofluid Dynamics. SIAM, Philadelphia, PA, 1975. 

[3] McDonald, D. A.: Blood Flow in Arteries , 2nd edn. Edward Arnold. London, 1974. 

[4] Lieberstein, H. M.: Mathematical Physiology: Blood Flow and Electrically Active Cells. 
Elsevier, New York, 1973. 
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2.7 Man-Machine Systems 

Many complex machines involve strong interaction between the machine and 
its human operator; for example, a helicopter and its pilot. For the proper 
design of such machines the machine and its operator must be viewed as a total 
system, with due account being given to interaction between the two. As such, 
models of a human operator are essential in the proper design of machines. For 
a specific machine, build a mathematical model for a human operator useful in 
the design of that machine. 


References 

[1] Siegel, A. I. and Wolf, J. J.: Man-Machine Simulation Models: Psychosocial and Physiolog¬ 
ical Performance Interaction. Wiley-Interscience. New York, 1969. 

[2] Fitts, P. M. and Posner, M. I.: Human Performance. Greenwood Press. Westport, CT, 1979. 

[3] Johannsen, G. and Rouse, W. B.: Mathematical concepts for modeling human behaviour in 
complex man-machine systems. Human Factors, 21 (1979), 733-47. 

[4] Funk, K. H.: Theories, models, and human-machine systems. Math. Modelling. 4(1983), 567- 
87. 


2.8 Drug Absorption 

Drugs are administered either to fight harmful bacteria responsible for 
infections or to compensate for imbalances due to some organ malfunction 
(e.g. insulin in the case cf diabetics). A problem facing doctors is deciding the 
quantity and the frequency of drug administration. If the drug concentration 
in the human body is too low, the drug is ineffective. On the other hand, if it is 
too high, it can cause serious side reactions and even death. Build a 
mathematical model for drug absorption in a human body so that it can be 
used to determine optimal drug administration policies. 


References 

[1] Saunders, L.: The Absorption and Distribution of Drugs. Bailliere. Tindall. London. 1974. 

[2] Lambrecht, R. M. and Rescigno, A. (ed.): Tracer Kinetics and Physiologic Modelling: Theory 
to Practice. Springer-Verlag, Berlin, 1983. 

[3] Berger, J. et al.: Mathematical Models in Medicine. Springer-Verlag, Berlin, 1976. 

[4] Gladtke, E. and von Hattingberg, H. M.: Pharmacokinetics: An Introduction. Springer- 
Verlag, New York. 1979. 


2.9 Genetics 

Certain human features (e.g. colour of eye, susceptibility to certain diseases) 
are inherited genetically. Of particular interest is haemophilia - the case where 
bleeding does not stop due to a body’s inability to produce clotting agents. 
Both males and females suffer from this disease but it is only the female who is 
capable of genetically transmitting this defect. In other words, a male suffering 
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from this defect cannot genetically transmit it to his offspring. Build a 
mathematical model to predict the spread of this disease in successive 
generations given the fraction of males and females suffering from it at a 
particular point of time. 


References 

[1] Crosby, J. L.: Computer Simulation in Genetics. Wiley, London, 1973. 

[2] Ewens, W. J.: Mathematical Population Genetics. Springer-Verlag, Berlin, 1979. 

[3] Edwards, A. W. F.: The Foundations of Mathematical Genetics . Cambridge University Press. 
Cambridge, 1977. 

[4] Maruyama, T.: Stochastic Problems in Population Genetics. Springer-Verlag. Berlin. 1977. 


2.10 Plant Growth 

Plants and trees are a renewable source for both humans and animals. We 
shall confine our attention to trees as a source of woodchips for paper 
manufacture. When the trees are logged, young seedlings are planted to 
produce the next generation for harvesting at a later date. The operators of 
forests and plantations are interested in plants which grow rapidly, so that 
they can be harvested more frequently. Build a mathematical model to 
describe plant growth. Such a model is of use in devising optimal harvesting 
policies to maximize woodchip yield. 


References 

[1] Black, M. and Edelman, J.: Plant Growth. Heinemann Educational, London, 1970. 

[2] Thornley, J. H. M.: Mathematical Models in Plant Physiology: Quantitative Approach to 
Problems in Plant and Crop Physiology. Academic Press, London, 1976. 

[3] Causton, D. R. and Venus. J. C.: The Biometry of Plant Growth. Arnold, London, 1981. 

[4] Evans, G. C.: The Quantitative Analysis of Plant Growth. Blackwell Scientific Publications, 
Oxford. 1972. 


3 Engineering and Technology 

3.1 Internal Combustion Engine 

The power source in a gasoline driven car is an internal combustion engine 
(ICE) which converts the chemical energy of gasoline into mechanical energy 
by the process of combustion. The efficiency of the engine and the pollution 
generated depends critically on the combustion process. The design of an 
efficient engine requires a good understanding of the combustion process 
inside the ICE. Build a mathematical model of the combustion process so that 
it is of use to design engineers when designing better engines. 
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References 

[1] Mattavi, J. N. and Amann, C. A. (eds): Combustion Modeling in Reciprocating Engines. 
Plenum Press, New York, 1980. 

[2] Hucknall. D. J.: Chemistry of Hydrocarbon Combustion. Chapman & Hall, London, 1985. 

[3] Benson, R. S.: The Thermodynamics and Gas Dynamics ofl.C.E. (edited by J. H. Horlock and 
D. E. Winterbone). Oxford University Press, New York, 1982. 

[4] Carpenter, M. H. and Ramos, J. L. Mathematical modelling of spark-ignition engines. Appl. 
Math. Modelling , 9 (1985) 40-52. 


3.2 Traffic Flow 

Traffic flow in the heart of a big city depends critically on the traffic control 
scheme. The scheme comprises: (i) the traffic light cycle at each intersection, 
and (ii) the synchronization between traffic lights at different intersections. The 
traffic light cycle consists of a sequence of red, amber and green signals of either 
fixed or variable duration. If the control scheme is poor, long lines can result at 
one or more of the intersections increasing the time to travel across the city. 
The problem facing the traffic engineer is to evolve a scheme which minimizes 
the expected time to travel across the city. Build a mathematical model to solve 
this problem. 


References 

[1] Ashton. W. D.: The Theory of Road Traffic Flow. Methuen, London, 1966. 

[2] Haight, F. A.: Mathematical Theories of Traffic Flow. Academic Press, New York, 1963. 

[3] Smith, W. L. and Wilkinson, W. E. (eds): Symposium on Congestion Theory. University of 
North Carolina Press, Chapel Hill, NC, 1965. 

[4] Drew, D. R.: Traffic Flow Theory and Control. McGraw-Hill, New York, 1968. 


3.3 Technology Substitution 

One of the characteristics of modern industrial societies is that old products 
are constantly being replaced by new ones. There are various reasons for this - 
products having a finite lifetime, technological changes causing the old 
product to become obsolete, etc. The process of an old product (e.g. black and 
white television) being replaced by a new product (e.g. colour television) over 
time is called “technology substitution”. 

Technology substitution is important to the manufacturers of new products 
when planning production to meet the expected demand. Build a mathemat¬ 
ical model which can be used to forecast technology substitution. 


References 

[1] Hurter. A. P. and Rubcnstein, A. H.: Market penetration by new innovation: the technical 
literature. Tech. For. Soc. Change , II (1978). 197-221. 
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[2] Midgley, D. F.: Innovation and New Product Marketing. Croom Helm, London, 1977. 

[3] Brown, L. A.: Innovation Diffusion; A New Perspective. Methuen, London, 1981. 

[4] Martino, J. P.: Technological Forecasting for Decisionmaking. Elsevier, New York, 1972. 


3.4 Power System Dynamics 

A power system is a collection of a number of power-generating units 
connected as a network to supply demand at different points in the network. 
Whenever a sudden change occurs (e.g. due to a short-circuit) the stability of 
the system is affected. If proper action is not initiated within a reasonable time 
the system can fail catastrophically, resulting in major blackouts. A problem of 
interest to power engineers is the dynamic behaviour of the system for different 
types of faults (or sudden changes) under different control actions. Build a 
mathematical model to help the engineer study the problem. 


References 

[1] Wilson, G. L. and Zarakas. P.: Anatomy of a blackout. IEEE Spectrum , 15 (1978), 38 46. 

[2] Anderson. P. M. and Fouad, A. A.: Power System Control and Stability. Iowa University 
Press, Ames, IO. 1977. 

[3] Gross, C. A.: Power System Analysis. Wiley, New York, 1979. 

[4] Usoro. P. B. et at.: Power system modelling for emergency state simulation. Math. Modelling , 
4(1983), 143-65. 


3.5 Research and Development Management 

Research and development (R&D) is an important activity in any modern 
industrial organization. The long-term viability of the organization depends 
on successful R&D activities. The R&D manager is faced with the difficult 
problem of allocating limited resources (money, manpower, space, etc.) 
between a number of competing projects. Since project outcome (i.e. success or 
failure) and potential commercial gain should the project be a success are 
uncertain, the problem of allocation becomes more difficult. Build a mathe¬ 
matical model which will help the R&D manager to make right decisions. 


References 

[1J Fusfeld. H. I. and Langlois, R. N.: Understanding R&D Productivity. Pergamon Press, 
Oxford, 1982. 

[2] Norris, K. and Vaizey, J.: The Economics of Research and Technology. George Allen & Unwin, 
London, 1973. 

[3J Green. K. and Morphet, C.: Research and Technology as Economic Activities. Buttcrworths, 
London. 1977. 

[4] Kamicn, M. I. and Schwartz. N. L.: Market Structure and Innovation. Cambridge University 
Press. Cambridge. 1982. 
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3.6 Product Warranty 

Warranty is a contractual obligation which requires the manufacturer to 
either repair or replace a product free of cost should the product fail within a 
pre-specified period (called the warranty period) subsequent to a sale. 
Warranty is sometimes imposed on the manufacturer to protect consumer 
interests, but often manufacturers offer it as a means to advertise product 
quality. Warranty servicing costs money and hence affects profitability. 
Manufacturers would like to have estimates of this cost as a function of 
warranty period before they make decisions on the warranty to be offered. 
Build a mathematical model which will help solve this problem. 


References 

[1] Blcuel. W. H. and Bender. H. E.: Product Service Planning: Service Marketing-Engineering 

\ t l t Q^ actlons ' AMACOM, a division of the American Manufacturers Association, New York, 
1980. 

^ £ en 3«(.S4)^r;[ thy ' D N P " C0St ana,ysis of warranl y policies. Nav. Res. Log. 

[3] Nguyen D. G and Murthy, D. N. P.: A general model for estimating warranty costs for 
repairable products. Trans. At IE. 16 (1984), 379-86. 

^ £uu™ ei 29 (^98^°345 an 5 a 6 ySiS ° f ^ ^ ^ rep,acemenl warranties. Nav. Res. Log. 


3.7 Quality Control 

In multi-stage production systems the product quality at the output of the final 
stage is a function of the input materials and the condition of the machines. 
Poor quality affects sales and hence the revenue generated. Quality can be 
controlled by proper inspection of the product during the intermediate stages 
of its production and by effective maintenance of the machines. These cost 
money. Hence, a sensible scheme must be employed to maximize the total 
profits. Build a mathematical model which will yield the optimal inspection 
and maintenance policies to maximize the total profits. 


References 

[1] Sarkadi, K. and Vincze, L: Mathematical Methods of Statistical Quality Control. Academic 
Press, New York, 1974. 

[2] Burr, I. W.: Statistical Quality Control Methods. Marcel Dekker, New York, 1976. 

[3] Guenther, W. C.: Sampling Inspection in Statistical Quality Control. GrifTm, London, 1977. 

[4] Eppen, G. D. and Hurst, E. G.: Optimal location of inspection stations in a multistage 
production process. Man. Sci ., 20 (1974), 1194-1200. 


3.8 Process Control 

In a chemical process the output quality and yield depends critically on the 
levels assigned to relevant factors (temperature, pressure, concentration, etc.). 
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It is important to optimally select these levels as well as to monitor and control 
the system to ensure that they stay at the desired levels. Build a mathematical 
model to help achieve this. 


References 

[1] Ray, W. H.: Advanced Process Control. McGraw-Hill, New York, 1981. 

[2] Stewart. W. E. et al .: Dynamics and Modelling of Reactive Systems. Academic Press, New 
York, 1980. 

[3] Sargent, R. W. H.: Advances in modelling and analysis of chemical process systems. Comp. 
Chem. Eng. ( G.B. ), 7 (1983), 219-37. 

[4] Himmclbiau, D. M.: Process Analysis by Statistical Methods. Wiley, New York, 1970. 


3.9 Soft ware Reliability 

Software reliability is important to computer manufacturers as well as users. 
The lack of reliability arises due to hidden faults (also called ‘"bugs”) which 
affect computer performance. The reliability is improved through testing to 
detect faults and by rectification of the detected faults. This is a costly process 
in terms of time and money. Build a mathematical model to describe the 
occurrence of faults and their discovery. Such a model is of use in making 
decisions with respect to the effort to be spent in testing. 


References 

[1] Sauer, C. H. and Chandy, K. M.: Computer Systems Performance Modeling. Prentice-Hall, 
Englewood Cliffs, NJ, 1981. 

[2] Walker, M. G.: Managing Software Reliability: the Paradigmatic Approach. North-Holland, 
Amsterdam, 1981. 

[3] Dunn, R. H.: Software Defect Removal. McGraw-Hill, New York, 1984. 

[4] Shanthikumar. J. G.: Software reliability models: a review. Micro-Electronics Rev. % 23 (1983), 
903-43. 


3.10 Image Modelling 

Biomedical images are used to detect medical disorders and problems. In the 
past the images were processed by doctors through visual examination. With 
the rapid developments in digital computers there is a trend towards the use of 
computers for image processing and diagnosis. For the computer to diagnose 
correctly with high reliability requires correct image models. Build a 
mathematical model for processing X-ray images to detect a specific medical 
problem. 

References 

[1] Sklansky, J. and Bisconte, J. C. (eds): Bio-Medical Images and Computers. Springer-Verlag, 
Berlin, 1982. 
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[2] Preston, K. and Uhr, L. (eds): Multicomputers and Image Processing: Algorithms and 
Programs. Academic Press, New York, 1982. 

[3] Rosenfeld, A. (ed.): Image Modelling. Academic Press, New York. 1981. 

[4] Jain, A. K.: Advances in mathematical models for image processing, Proc. IEEE . 69 (1981) 
502-28. 


4 Physical Sciences 


4.1 Ocean Waves 

Exploration of natural resources under the ocean floor requires use of offshore 
structures such as platforms. These structures are subjected to constant 
beating by ocean waves. Hence the design of such structures requires the 
modelling of the dynamic loading effects of wave action. Build a mathematical 
model to describe the wave action on fixed structures so that it can be used in 
the proper design of offshore structures. 


References 

[1] LeBlond. P. H. and Mysak, L. A.: Waves in the Ocean. Elsevier, Amsterdam, 1978. 

[2] Kinsman, B.: Wind Waves: Their Generation and Propagation on the Ocean Surface. Prentice- 
Hall. Englewood Cliffs, NJ, 1965. 

[3] LeBlond. P. H. and Mysak. L. A.: Ocean waves: a survey of some recent results. SIAM Rev., 
21 (1979). 289-328. 

[4] Meyers, R. E.: Waves on Beaches and Resulting Sediment Transport. Academic Press, New 
York. 1972. 


4.2 Sand Dunes 

The formation of sand dunes and their encroachment into de-forested lands 
near deserts has become a serious problem in many parts of the world. The 
problem of interest is the movement of sand dunes. A model is needed to 
predict the spread of desert, as well as to devise policies to control the spread. 
Build a mathematical model to describe the movement of sand dunes. 


References 

[1] Bagnold, R. A.: The Physics of Blown Sand and Desert Dunes. Methuen, London, 1954. 

[2] Yalin, M. S.: Mechanics of Sediment Transport . 2nd edn. Pcrgamon Press. Oxford. 1977. 

[3] Brown, R. A.: Analytical Methods in Planetary Boundary-layer Modelling. Halsted 
Press Wiley, New York, 1974. 

[4] Cekirge, H. M. et ah Movements of sand dunes: I. Two-dimensional model. Math. Modelling , 
3(1982), 371-84. 


4.3 Water Seepage 

Dams are built across rivers to store water for multiple uses - irrigation, 
power generation, flood control, etc. Some of the water stored is lost due to 
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evaporation and seepage through the ground. Before a dam is to be built, it is 
important to estimate these losses. If the losses are significant, then the viability 
of the dam is in question. To obtain an estimate of seepage loss one must 
develop a theory for seepage. Build a mathematical model to describe the 
theory of water seepage through porous media. 

References 

[1] Rushton, K. R. and Redshaw, S. C.: Seepage and Groundwater Flow: Numerical Analysis hv 
Analog and Digital Methods. Wiley, Chichester. 1979. 

[2] Verruijt, A.: Theory of Ground-Water Flow. Macmillan, New York, 1970. 

[3] Lloyd, E. H. el al. (eds): The Mathematics of Hydrology and Water Resources. Academic 
Press, New York, 1979. 

[4] Bear, Dynamics of Fluids in Porous Media. Elsevier, New York, 1972. 


4.4 Atmospheric Pollution 

Most industrial processes emit various chemicals into the atmosphere. These 
get transmitted over large distances and often affect the ecology when they 
return to ground in the form of acid rain or other undesirable fallout. As such, 
many regulating authorities require an ecological evaluation of the potential 
damage that can be caused under the worst scenario before permission is 
granted to build large industrial plants. Build a mathematical model which 
will predict the dispersion of pollutants in the atmosphere and so be useful in 
evaluating potential ecological damage. 

References 

[1] Crank, J.: The Mathematics of Diffusion. Clarendon Press, Oxford, 1956. 

[2] Weber, E. (ed.>: Air Pollution: Assessment Methodology and Modelling. Plenum Press. New 
York, 1982. 

[3] Brebbia, C. A. (ed.): Mathematical Models for Environmental Problems. Pentech Press. 
London. 1976. 

[4] Hidy,G. M.and Brock, J.: The Dynamics of Aero-Colloidal Systems. Pergamon Press. Oxford. 
1971. 

4.5 Forest Fires 

A forest fire occurs when dry vegetation in a forest burns in an uncontrolled 
manner. It is a natural phenomenon initiated by a lightning strike or some 
other ignition event. The burning and spread of forest fires is a complicated 
phenomenon, an understanding of which will help in fighting such fires should 
they occur close to human settlements. Build a mathematical model to 
describe the burning and the spread of forest fires. 

References 

[1] Strehlow, R. A.: Combustion Fundamentals. McGraw-Hill, New York, 1984. 

[2] Kanury, A. M.: Introduction to Combustion Phenomena (for fire, incineration, pollution and 
energy applications). Gordon & Breach, New York, 1975. 
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[3] Spalding, D. B.: Mathematical models of turbulent names: a review, Comb. Sci. Tech IT 
(1976), 3-25. 

[4] Zel dovich, Ya. B. et al.: The Mathematical Theory of Combustion and Explosions. Consult¬ 
ants Bureau, New York, 1985. 


4.6 Oil Spillage 

Oil spillage in rivers and oceans is a matter of concern to environmental 
authorities as it can lead to serious ecological damage as well as severe 
economic losses. A model which can predict the movements of a patch of oil is 
of use to authorities to plan strategies to minimize the damage caused by the 
spill. Build a mathematical model which can predict the movement of an oil 
spill in a large bay. 


References 

[1] Nihoul, J. C. (ed.): Modelling of Marine Systems. Elsevier, New York, 1975. 
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4.7 Oil Extraction 

The efficient extraction of oil from underground reservoirs is of great 
importance due to the ever-increasing demand for oil. In order to develop 
efficient methods, one needs a model of the behaviour of hydrocarbon 
reservoirs. Build a mathematical model which will help solve the problem. 
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4.8 Waste Water Treatment 

In disposing of waste water it is important to ensure that it does not affect the 
water supply system. This is done by treating the waste water either using 
chemicals or by some natural process. Build a mathematical model to describe 
the underlying processes involved in waste water treatment. Such a model is of 
use in devising economical and effective treatment policies for waste water. 
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4.9 Semiconductor De vices 

Semiconductor devices are used extensively in many applications. The 
analysis of electronic circuits involving such devices requires appropriate 
mathematical models of the devices. Build a mathematical model to describe 
the dynamic behaviour of semiconductor devices of a particular type. 
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4JO Fatigue Failure 

A fairly common cause for mechanical component failure is the onset and 
growth of a fatigue crack. The time to component failure (or equivalently, the 
lifetime of the component) depends on various factors, the important one 
being the mechanical loading history. A relationship between the time to 
failure with the relevant factors is useful in the design and operation of the 
components. Build a mathematical model which can be used to predict fatigue 
failure in components. 
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5.7 Advertising 

Advertising is a means by which a manufacturer can promote the product and 
improve sales and, hence, the revenue generated. However, advertising costs 
money and is worthwhile only if the cost of advertising is less than the increase 
in the sales revenue. The problem facing the manufacturer is to evaluate the 
effectiveness of different advertising policies and the selection of the optimal 
policy. Build a mathematical model to help solve this problem. 
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5.2 Perishable Item Inventory 

Retailers of perishable items (e.g. milk, fruits, vegetables) face a dilemma. If 
they have a large inventory, they might be forced to discard items not sold 
within a time period determined by the shelf-life of the item. This causes 
economic loss. On the other hand, if the inventory is low, they face the prospect 
of reduced profit due to unsatisfied customers. The problem is made 
complicated by variable ordering costs and uncertainty in demand. Build a 
mathematical model to obtain optimal ordering policies and inventory levels 
to maximize the expected profit. 
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5.3 Insurance Premiums 

The economic viability of an insurance company depends critically on its 
ability to assess risks and decide on the premium charged to cover risks. If the 
premiums are low, then payouts can exceed revenue collected and the 
company can go bankrupt. On the other hand, if they are high, the number of 
customers will go down, thus affecting profitability. The problem facing an 



312 


MATHEMATICAL MODELLING 


insurance company is the premium it should charge for different risks to 
ensure economic viability and maximize its profits. 
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5.4 Supermarket Location 

A retailer is planning to open a chain of new supermarkets in a big 
metropolitan area. The problem facing the retailer is deciding on the number 
and the location for the supermarkets. Wrong locations, and too few or too 
many, will affect the profitability of the venture. Build a mathematical model 
which will assist the retailer in making the right decision. 
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5.5 Oil Exploration 

An oil exploration company has to make a decision on the location of the next 
hole to be drilled. If the drilled hole turns out to be dry then the investment 
(cost of drilling) is a total loss. If the hole yields gas it is a partial success, and if 
the outcome is oil it is a total success. The company owns leases at many 
different locations. The decision problem is made difficult because of the 
uncertainty in the outcome, as well as due to the fact that only imperfect 
information can be obtained by doing some testing. Build a mathematical 
model which will help the company in making the right decision to maximize 
its expected profits. 
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5.6 National Economy 

The state of a national economy is defined through aggregate variables such as 
GNP, interest rates, inflation, trade balance, unemployment, etc. These 
variables interact in a dynamic way. Thus an action to influence one variable 
can lead to undesirable changes in one or more of the remaining variables. As a 
result models which can predict the consequences of different actions would be 
of immense help both to economists and politicians. Build a mathematical 
model which will predict the effect of different control actions. 
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5.7 Brand Switching 

A shopkeeper stocks a wide variety of cigarette brands and is constantly faced 
with the problem of running short of certain brands and being left with large 
inventories of other brands due to customers changing their choice of brands. 
As a consequence, the profits of the shopkeeper are reduced. Build a 
mathematical model which will help the shopkeeper optimize on his ordering 
policies, taking into account the random nature of brand switching. 
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5.8 Learning Behaviour 

One way in which young children can be encouraged to learn is by a process of 
reward when there is positive response, and no reward (or sometimes 
punishment) with negative response. A problem of interest to educators is to 
devise schemes that will improve the rate of learning. Build a mathematical 
model which can be used to evaluate different schemes before they are tried on 
the children. 
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5.9 Resource Depletion 

One of the serious problems facing mankind is that, with the increasing 
population and the rate of consumption, there is a possibility of running out of 
various natural resources in a fairly short time interval. Economic planners 
need estimates for the time intervals for the exhaustion of different resources 
under various policies for consumption. This information is also useful in 
initiating conservation schemes and initiating research into alternatives. Build 
a mathematical model which will yield the desired answers. 
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5.10 Population Migration 

In all countries there is a constant migration of people from one region to 
another. The reasons for such migration are many. A problem of interest to 
planners is the prediction of population migration. Build a mathematical 
model to obtain predictions of population migration. 
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CHAPTER 18 


Reference Material 


1 Introduction 

As indicated in the Preface, the list of books on mathematical models and 
modelling is large. In Section 2 we give a comprehensive list of such books, 
along with a comparative evaluation and annotation so that they can be used 
as sources for additional material to supplement the text. In Section 3 we give 
a list of journal articles on models, modelling and teaching modelling, 
primarily for the instructor's use. The articles are arranged alphabetically by 
author. We conclude by giving a list of abstracting journals on mathematical 
modelling, which the student will find useful when model building. 

2 Books on Models and Modelling* 

2.1 Framework for Comparative Evaluation 

As seen from Part I, mathematical modelling requires an understanding of 
many different topics. Effective teaching of mathematical modelling requires 
that the student be exposed to all of these topics. Broadly speaking the 
different topics needed for modelling closely follow the chapters in this book 
and are as follows: 

A. Models and model building 

B. System characterization 

C. Different mathematical formulations 

D. Analysis of mathematical formulations 

E. Parameter estimation 

F. Design of experiment 

G. Validation 


•This section is based on: A comparative evaluation of books on mathematical modelling Math 
Modelling, 9 (1987), 17-28. 
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For the purpose of comparative evaluation, we divide topics C and D into 
further subtopics as follows: 

Cl. Static formulations (deterministic/probabilistic) 

C2. Dynamic deterministic formulations 
C3. Dynamic stochastic formulations 

and 

Dl. Analytical methods 
D2. Computational methods 
D3. Simulation methods 

(Many of these can be further subdivided, e.g. C2 can be subdivided into 
difference equations, ordinary differential equations, partial differential equ¬ 
ations and so on, but the subdivision shown above is sufficient for the review 
process.) 

We include a further topic to show the totality of the modelling process. 

H. Application/Case studies 

and this is subdivided into four subtopics as follows: 

H1. Physical sciences 
H2. Biological sciences 
H3. Social sciences 
H4. Technology 

As a result we have a total of sixteen topics/subtopics, and these form the basis 
for the comparative evaluation. 

2.2 Comparative Evaluation 

Table 18.1 gives the topics/subtopics covered by each of the books included 
for evaluation. This is indicated by an x in the appropriate column. A few 
comments are appropriate at this stage: 

(i) If a book concentrates on one method of analysis we classify the book 
as covering that particular method of analysis and ignoring other 
methods. 

(ii) If a book concentrates on some types of formulations more than others, 
then it is classified as examining only those types covered in greater detail. 

(iii) For topics, E, F and G we use the following convention: a book covers a 
topic if either a chapter is devoted to it or if the topic is stressed 
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Table 18.1 Comparative Evaluation of Books on Mathematical Modelling 

Authors A B Cl C2 C3 Dl D2 D3 E F G HI H2 H3 H4 


Ahuja and Schachter 

X 

X 

X 

X 


X 

X 

X 


X 

X 

X 

X 

X 

X 

Andrews and McLone 

X 

X 

X 

X 

X 

X 






X 

X 

X 

X 

Aris 

X 

X 


X 


X 









X 

Avula ex al. 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

Bender 

X 

X 

X 

X 


X 


X 








Boyce 


X 

X 

X 

X 

X 






X 

X 


X 

Boynton 

X 


X 



X 








X 


Bradley el al. 


X 

X 

X 


X 

X 





X 



X 

Brams ex al. 

X 

X 

X 



X 






X 

X 


X 

Braun ex al. 

X 

X 


X 


X 






X 

X 


X 

Burghes and Wood 

X 

X 

X 

X 


X 








X 


Burghes et al. 

X 

X 

X 

X 


X 






X 

X 

X 

X 

Clark 

X 

X 

X 

X 


X 

X 



X 



X 


X 

Close and Frederick 

X 

X 


X 


X 






X 


X 


Coats and Parkins 

X 

X 


X 




X 





X 

X 


Dym and Ivey 

X 

X 

X 

X 


X 






X 

X 

X 

X 

Giordano and Weir 

X 

X 

X 

X 


X 

X 

X 

X 

X 


X 

X 

X 

X 

Greenspan 


X 


X 



X 





X 




Haberman 

X 

X 


X 


X 






X 

X 


X 

Hawkes 


X 












X 


Howard 

X 

X 


X 








X 



X 

Hyvarinen 

X 

X 

X 







X 





X 

James 

X 

X 

X 

X 


X 

X 





X 

X 


X 

James and McDonald 

X 

X 

X 

X 


X 


X 





X 

X 

X 

Kemeny and Snell 

X 

X 

X 

X 

X 

X 








X 


Lancaster 

X 

X 

X 

X 


X 






X 

X 



Lucas et al. 

X 

X 

X 



X 


X 






X 

X 

Maki and Thompson 

X 

X 

X 

X 

X 

X 


X 





X 

X 


Malkewitch and Meyer 

X 

X 

X 

X 

X 

X 

X 



X 

X 



X 


Marchuk and Nisevich 

X 

X 

X 

X 


X 

X 






X 



Marcus et al. 


X 



X 

X 







X 



McClamroch 

X 

X 


X 


X 


X 







X 

Mihram 

X 


X 


X 



X 








Nicholson 


X 

X 

X 


X 

X 

X 







X 

Noble 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

Olinick 

X 

X 

X 

X 

X 

X 







X 

X 


Osborne and Watts 

X 

X 



X 



X 



X 


X 

X 

X 

Ransom 

X 

X 

X 

X 

X 

X 

X 

X 





X 



Rivett 

X 

X 

X 



X 


X 






X 


Roberts 

X 

X 

X 


X 

X 







X 

X 


Ross 





X 

X 



X 







Saaty 

X 

X 

X 

X 

X 

X 

X 

X 

X 



X 

X 

X 


Saaty and Alexander 

X 

X 

X 

X 

X 

X 






X 

X 

X 


Smith 

X 

X 

X 


X 



X 







X 

Sprict et al. 

X 

X 


X 

X 

X 

X 

X 

X 




X 


X 

Weinberg 

X 

X 

X 

X 

X 







X 

X 

X 

X 

Zeigler 

X 

X 



X 



X 





X 



Zierep 


X 


X 


X 






X 
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significantly throughout the book so that it is treated as being of 
importance in the modelling process. 

(iv) For topic H (subtopics H1-H4) the basis for classification is again the 
relative importance given to applications/case studies from each of the 
four disciplines. 

Brief descriptive comments are given on each book. 

Ahuja and Schachter (1983) This book presents formal models of spatial 
patterns. Examining the sequence of steps involved in modelling beginning 
with model conception analysis, fitting, and ending with synthesis of data, the 
book illustrates the various phases of modelling and synthesis. 

Andrews and McLone (1976) This is a collection of modelling exercises 
written by different authors. Seven of the exercises are from either physical 
sciences or technology; two from biological sciences and five from social 
sciences. In addition there is a chapter each on network flow models and role of 
the catastrophe method. The main emphasis is on model formulation. 

Aris (1978) This book concentrates mainly on the use of ordinary differential 
equation formulations (lumped models) and partial differential equation 
formulations (distributed models) in modelling systems and processes in 
chemical engineering. The emphasis is on modelling and analytical methods of 
analysis. The appendices examine three case studies. 

Avula, Kalman, Liapis, and Rodin (1984) This is a volume of papers 
presented at the Fourth International Conference on Mathematical Model¬ 
ling. It contains seven invited lectures concerning topics such as deterministic 
:ontrol, multicriterion decision making, and the use of iterative graphics. 
Other subjects covered extensively include methodology, systems theory, 
parameter estimation and system identification, optimization, stochastic 
modelling in economics, transportation, and mathematical modelling 
education. 

Bender (1978) This is a rather unusual book. It is divided into two parts. Part 
I deals mainly with models involving non-dynamic formulations. The chapter 
entitled “Arguments from Scale" is interesting. Part II deals with models 
nvolving simple ordinary differential equation formulations. The model 
building and analysis are strongly interwoven and the range of applications 
:onsidered is large and varied. 

Berry et al. (eds) (1986) This is a collection of 23 articles dealing with various 
ssues related to mathematical modelling; the teaching of mathematical 
noddling and a variety of case studies with a particular emphasis on the role 
ind use of microcomputers. 
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Berry et al. (eds) (1987) This is a collection of 25 articles grouped under two 
headings: (A) Modelling Courses and (B) Modelling and Schools. The articles 
discuss various issues related to the teaching of mathematical modelling at 
different levels and in different disciplines. 

Boyce (ed.) (1981) This is a collection of seven case studies written by 
different authors. The topics covered are herbicide resistance, elevator system, 
traffic flow, semiconductor crystal growth, railroad/information (network 
flow models), computer data communication and system security verification. 
The case study on semiconductor crystal growth (by L. O. Wilson) is especially 
interesting, for it describes the evolution of the modelling process very nicely. 

Boynton (1980) This is an introductory book that combines the logic of 
mathematics with the art of politics. It assumes a knowledge of algebra to 
study discrete time systems. Applications of the book include such special 
issues as policy outputs, voting distributions and behaviour regulation. 

Bradley, Gibson and Cross (1981) This is a collection of seven case studies to 
model systems of processes from different industries and written by different 
authors. The topics covered are: blast furnaces, sound distribution in a record 
player, hydraulic buffers, grinding of particles, moving fresh glass, motion of 
an automobile, and freezing and thawing of meat. Each case study is a result of 
group activity and often the group leader presents his modelling attempt and 
compares it with that of the group. 

Brams, Lucas and Straffin (eds) (1983) This book is a collection of 13 case 
studies written by different people. Each case study deals with the modelling 
and analysis of a topic from a political or related field. In some cases the model 
analysis is compared with the real data. Each case study ends with notes for the 
instructor. 

Braun, Coleman and Drew (eds) (1982) This book deals exclusively with 
models involving differential equation formulations. It consists of 23 chapters 
divided into five parts: Part 1 is general; Part 2 deals with modelling exercises 
involving first-order differential equation formulations; Part 3 (4) deals with 
three (four) modelling exercises involving higher-order linear differential 
equation (partial differential equation) formulations; Part 5 deals with 
modelling different facets of traffic flow, and Part 6 deals similarly with 
population interactions. 

Burghes and Wood (1979) This book deals mainly with very simple 
mathematical formulations; their analysis and application to problems in 
social science. The emphasis is on deterministic non-dynamic formulations, 
with a chapter each on difference and first- and second-order differential 
equations. 
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Burghes, Huntley and McDonald (1982) The book consists of four parts: Part 
1 deals with interesting case studies; Part 2 is a collection of 20 simple 
modelling exercises involving mainly deterministic formulations (either non¬ 
dynamic or simple ordinary differential equation formulations), in some cases 
real data is used for modelling and validation; Part 3 deals with the modelling 
process in a formal sense; and the last part contains 20 topics suitable for 
student modelling assignments. 

Clark (1976) This book discusses mathematical models applicable to the 
optimal utilization of biological resources focusing on conservation of 
productive resources rather than the preservation of natural environments. A 
general model is put forth and then in succeeding chapters mathematical 
techniques of optimal control theory and dynamical systems are introduced. 

Close and Frederick (1978) This book provides exposure to dynamic 
systems. Assuming a knowledge of differential and integral calculus the 
authors cover mechanical, electrical, electromechanical, thermal, and hydrau¬ 
lic systems. They also include five case studies drawn from a variety of systems, 
including a sociological system, a thermal system, and a velocity control 
system. 

Coats and Parkin (1977) This book concentrates on computer modelling for 
social sciences. The first five chapters deal with models and modelling. The last 
three chapters deal with three case studies. The topics for the case studies are: 
human memory; bed usage in hospitals; and economics of education. 

Dym and Ivey (1980) This book is unusual in its presentation. It consists of 
two parts: Part A deals with what the authors call “mathematical ideas of 
interest to the model builder” and contains four chapters. The chapters on 
dimensional analysis and on scaling are particularly interesting. Part B deals 
with applications and contains 10 chapters. A chapter each is devoted to 
ordinary differential equation formulation models and linear programming 
models. Two chapters deal with oscillations and there are chapters on traffic 
flow and diffraction. 

Giordano and Weir (1985) This book places emphasis on identifying 
problems and their underlying assumptions, determining appropriate data to 
be collected, constructing and selecting models, and testing and validating 
models. It provides numerous examples and applications in mathematical 
sciences, operations research, engineering, social and life sciences, and 
management science. 

Greenspan (1973) This book concentrates on the use of difference equation 
formulations to model a range of problems in dynamics and fluid mechanics. 
The emphasis is solely on the model building. 
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Haberman (1977) This book examines modelling in three areas- 
mechanical vibrations, population growth, and traffic flow. In each area a 
series of models are developed with a lot of emphasis on the analysis and 
interpretation of the model. The book is unusual in the sense that system 
understanding, modelling and analysis are strongly interwoven. 

Hawkes (ed.) (1973) This is a collection of articles dealing with trends in 
mathematical modelling in mainly social sciences. 

Howard (1979) This book deals with partial differential equation formul¬ 
ation in modelling problems in particle dynamics, fluid mechanics, cosmology, 
and application of models in aeronautical engineering. 

Hyvarinen (1970) This book deals mainly with the use of non-dynamic 
formulations (especially regression formulations) in modelling industrial 
processes. It contains a chapter on design of experiments and two chapters on 
the analysis of experimental results. 

James (1978) This book is a revised version of the Proceedings of the 
Conference on the Use of Mathematical Modeling in Water Pollution Control. 
Nineteen papers are included. Techniques of water pollution control systems 
are developed, as are models that apply to polluted environments, waste 
treatment and water resources. 

James and McDonald (1981) This book contains 15 case studies written by 
different authors. They cover a wide range of topics and the emphasis is mostly 
on model building. One or two case studies involve simulation. The models 
involve mainly non-dynamic formulations and the dynamic models discussed 
are based on simple ordinary differential equation formulations. 

Kemeny and Snell (1972) This is a reprint of an old book first published in 
1962. It deals with seven case studies from social sciences and one from 
biological science. The topics covered include: ecology, money flow, organiz¬ 
ation, scheduling, and a few others. The modelling involves both deterministic 
and stochastic formulations. 

Lancaster (1976) This book contains chapters on growth of populations, on 
population genetics, and on collision of particles. In particular, the last 
modelling exercise discusses models under different assumptions and laws. In 
addition there are chapters on linear programming and on search for 
maximum. 

Lucas, Roberts and Thrall (eds) (1983) This book contains a large number of 
modelling exercises dealing with application to economics, traffic theory, 
operations research and other fields. The models involve mainly non-dynamic 
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formulations. A few of the exercises involve computer simulation but the 
majority use analytical methods of analysis. The range of topics covered is 
fairly wide. There are two interesting case studies involving problems in power 
engineering. 

Maki and Thompson (1973) This book takes an axiomatic approach to 
model building. It discusses modelling topics from biological and social 
sciences involving both deterministic and stochastic dynamic model formul¬ 
ations. The last chapter, entitled “Practical aspects of model building”, 
discusses intuitive evaluation, statistics for the model building process and 
simulation. 

Marchuk and Nisevich (1980) This book concerns the application of 
mathematical theory to medicine. In particular it introduces a mathematical 
model of viral disease and then develops methods for quantitative assessment 
of the seriousness of viral hepatitis. These models are then used to study the 
dynamics of functional recovery. 

Marcus-Roberts and Thompson (eds) (1983) This book is a collection of 12 
case studies in biological sciences written by different authors. The case studies 
involve both deterministic and stochastic dynamic formulations and in some 
of the studies real data are used. The emphasis is on models suitable for 
analytical methods of analysis. Each chapter contains notes at the end for the 
instructor. 

McClamroch (1980) This book concentrates on models involving mainly 
ordinary differential equation formulations. Seven chapters are devoted to 
differential equation models. In each chapter several interesting case 
studies/applications are discussed. Both modelling and analysis are equally 
emphasized. Most of the case studies come from engineering, and an appendix 
deals with CSMP simulation. 

Mihram (1972) This book concentrates mainly on probabilistic non¬ 
dynamic and discrete stochastic process formulations. The emphasis is on 
computer simulation. In addition, it contains two chapters on the design of 
experiment. 

Nicholson - Vol. 2 (1980) This book is a collection of seven case studies from 
various branches of engineering. The topics for the case studies are: iron and 
steel making, electric arc furnace, turbo generator, mineral extraction, and 
manufacturing. The models involve either non-dynamic or simple differential 
equation formulations. 

Noble (1967) This book is based on examples of applications of undergradu¬ 
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ate mathematics in engineering, submitted by various university mathematics 
and engineering departments and some industrial companies. The four 
sections of the book deal with applications of elementary mathematics, 
ordinary differential equations, field problems such as flow of fluids or heat, 
the diffusion of gases, stresses and strains in solids, linear algebra, and 
probability. 

Olinick(1978) This book contains 14 chapters, with 10 devoted to modelling 
specific topics. In each modelling chapter often more than one model is built 
and analysed. The interesting aspect is the historical perspective for modelling 
topic. The models are mainly from biological and social sciences and involve a 
range of mathematical formulations. 

Osborne and Watts (eds) (1977) This is a collection of several articles with the 
emphasis on simulation. Six of them deal with case studies from mainly 
biological and social sciences. One chapter deals with the use of statistics in 
model validation. 

Ransom (1981) This book describes the uses and applications of models in 
development biology. A variety of model systems are discussed in the first half 
of the book. The second half concentrates on computer modelling. 

Rivett (1972) This book contains 12 chapters and a summary. It deals with 
the philosophical questions of model building first, followed by applications to 
business and economics. Investment, sequenced divisions, decision and utility 
theory and competitive problems are some of the subjects to which individual 
chapters are devoted. 

Roberts (1976) This book deals mainly with two types of formulations- 
graph theoretic and Markov chains, and their use in modelling a range of 
topics in social and biological sciences. This is an excellent book for the use of 
graph theoretic representations in system characterization. The approach is 
mainly axiomatic and the emphasis is on analytical methods. 

Ross (1983) This book discusses various stochastic dynamic formulations 
and their use in modelling. In particular, it examines Markov chains, Poisson 
processes, and renewal theory. The application of these to queues and 
reliability are discussed. The final chapter discusses parameter estimation for 
some of the formuations. 

Saaty (1973) This book consists of the notes used at a summer seminar on 
mathematical modelling. The notes are divided into three parts. Part I is a 
general discussion of mathematics and behavioural sciences with emphasis on 
model planning and building and the process of decision making. Part II 
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investigates eight topics central to mathematical modelling of social sciences: 
allocation and optimization, conflict resolution, scheduling tasks to machines, 
flow and synthesis problems in networks, inventory control, queues and 
congestions, reliability, maintenance and renewals, and search. Part III 
describes some structured and unstructured problems. Among these are the 
travelling salesman problem, arborescences and questionnaires, epidemics 
and stochastic barriers, and parking at an aircraft terminal. 

Saaty and Alexander (1981) This book is divided into three parts. Part 1 
deals with methodology; Part 2 with the “framework"; and Part 3 with 
applications. The different types of formulations considered in the book are 
given in Part 2. Each formulation is illustrated by a simple application 
involving the formulation. The chapter entitled “Equations" deals with 
different types of equation formulations, and the chapter entitled “Stochastic 
processes" looks at a range of stochastic formulations. The applications cover 
a wide range of topics and the emphasis is on model building and 
analysis. 

Sheep (1983) This is the 27th volume of the Proceedings of Symposia in 
Applied Mathematics. The subject covered, computed tomography, is discus¬ 
sed in seven papers. Each paper suggests problems, both relevant and of 
“intrinsic interest" to mathematicians. The volume addresses the mathemat¬ 
ical modelling problem of finding the appropriate level of abstraction in a 
mathematical model of a real-world situation. 

Spriet and Vansteenkiste (1982) This book examines the use of stochastic 
difference equation formulations and deterministic partial differential equ¬ 
ation formulation. Parameter estimation is discussed in great detail for 
stochastic difference equations. The numerical solutions of partial differential 
equations are discussed, along with software packages. The application of 
partial differential equations to environmental problems is discussed briefly. 
In addition, there is a chapter on hardware trends and their impact on 
simulation. 

Weinberg (1975) This is an introductory book which assumes a knowledge 
of high school algebra. The book is divided into seven chapters, each of which 
ends with a number of exercises. The book also includes an appendix on how 
to read mathematical notation. The chapters are entitled: the problem, the 
approach, system and illusion, interpreting observations, breaking down 
observations, describing behaviour, and some systems questions. 

Zeigler (1976) This book concentrates on developing a theory of modelling 
and simulation. The system characterization is done with the aim of 
stimulating the model on a computer. Various aspects of simulation are 
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discussed in great detail. Appendix A deals with a case study - Modelling the 
human brain. 

Zierep (1971) This book discusses dimensional analysis, similarity laws, and 
dimensionless parameters. It uses the study of classical mechanical and 
thermodynamic similarity laws of gas dynamics to cover subjects such as 
linearized subsonic and supersonic flow, transonic and hypersonic flow, and a 
brief survey of corresponding flows with relaxation. 
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4 Journals 


4.1 Journals on Mathematical Modelling 

[1] Mathematical and Computer Modelling 

[2] Applied Mathematical Modelling 

[3] International Journal of Modelling and Simulation 

[4] Journal of Interdisciplinary Modelling and Simulation 

[5] Stochastic Models 

The following journals contain many articles on models and modelling. 

[6] Teaching Mathematics and its Applications 

[7] Bulletin of the Institute of Mathematics and its Applications 

[8] UMAP Journal: The Journal of Undergraduate Mathematics and its 
Applications 

UMAP modules contain many articles on models and modelling. 
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4.2 Abstracting and Indexing Journals 

It is impossible to give a complete list of abstract and indexing journals as it 
would run into several hundred. Instead, we give a short list which covers 
nearly all the disciplines where mathematical models have been used for 
solving problems. 

[1] Mathematical Reviews 

[2] Statistical Theory and Method Abstracts 

[3] Biological Abstracts 

[4] Applied Ecology Abstracts 

[5] Ecology Abstracts 

[8] Population Abstracts 

[9] Entomology Abstracts 

[10] Pollution Review 

[11] Forestry Abstracts 

[12] Aquatic Sciences and Fisheries Abstracts 

[13] Tropical Diseases Bulletin 

[14] Population Index 

[15] Geo Abstracts 

[16] Sociological Abstracts 

[17] Business Periodicals Index 

[18] Index of Economic Articles 

[19] International Bibliography of Economics 

[20] Computer and Control Abstracts 

[21] Computing Reviews 

[22] The Engineering Index 

[23] Applied Mechanics Reviews 

[24] Cybernetics Abstracts 

[25] International Abstracts in Operations Research 

[26] Physics Abstracts 

[27] British Chemical and Physiological Abstracts 

[28] Psychological Abstracts 

[29] A London Bibliography of the Social Sciences 

[30] International Political Science Abstracts 

[31] Linguistic and Language Behaviour Abstracts 

[32] Bibliographic Linguistique 

[33] Bibliographic Geographique Internationale 

[34] Education Index 

[35] British Education Index 

[36] International Bibliography of the Social Sciences 
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